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EDITOR'S INTRODUCTION 

Psychologists have been ardent professionals, an eager, 
easily converted lot. No wonder the cry is often heard among 
them "There is news in the land of Babel," meaning "Here 
now is the psychology!" So it has come about that there are 
mechanisms of automatic defense against the asserted exclu- 
siveness and the propaganda of behavior theorists. Chief 
among them is undoubtedly negative adaptation or, as the 
authors of this book would have it, with greater illumination, 
"absence of reinforcement." 

I grant the serviceableness of such defenses in preserving 
common sense and healthy skepticism, yet I am sorry for the 
psychologist who misses this out-of-the-ordinary textbook. 
He may be one whose own work lies far afield. But no matter 
what that work may be, it would enhance his vision and build 
his morale to know that it has been possible already to dem- 
onstrate, operationally and therefore beyond challenge, so 
much lawfulness of behavior on the single assumption that 
all the features of learned behavior are but the routes, straight 
routes and detours, down which an organism has been baited. 
He might quibble over the excessive use of rats and balk at 
the extrapolations to higher behavior, but he could not deny 
massive facts that stick. 

I especially congratulate you, the thoughtful student, whose 
first or early exposure to psychology is through this book. Its 
use as a text is a guarantee that you have an instructor who 
knows that the basis of every science lies not in talk and proof 
by say-so, but in experimental methods. At best you are going 
to learn psychological science by your own sciencing, in a 
laboratory. If circumstances deny that privilege, your instruc- 
tor will still see to it that you get the next best by perfectly 
feasible demonstrations in the classroom. Finally, if this book 



vi EDITOR'S INTRODUCTION 

arouses in you the tingling enthusiasm that in an earlier form 
it has plainly evoked in many students, you are on your way 
to insights of the greatest value. They will be of use to you 
whether you become a psychologist, teacher, lawyer, sales- 
man, philosopher, doctor, or just a person who feels the need 
to see beneath the seeming chanciness of human behavior. 

RICHARD M. ELLIOTT 



PREFACE 

This book is a new kind of introduction to psychology. It 
is different in that it represents for the first time a point of 
view that is coming to guide the thinking and research of an 
active group of psychologists in this country. The members 
of this group are mainly experimentalists, laboratory workers, 
who spend much of their time in observing and measuring 
the behavior of organisms rats, dogs, guinea-pigs, apes, 
pigeons, and, of course, human beings. They are unflaggingly 
on the lookout for fundamental principles of behavior- 
principles that hold true for the white rat as well as the 
college student, for the dog in laboratory harness as well as the 
patient on the psychoanalyst's couch, for the tribal savage as 
well as the sophisticated product of our own culture. Already 
they have discovered some of these principles and have 
brought them together in the beginnings of scientific theory. 
Other principles are, at present, only suspected, and the 
search goes on at an ever faster pace. In this book, we try to 
tell about the ones of which we are certain; we describe some 
of the research they are based on; and we point out the way 
in which they may be organized to give a meaningful picture 
of human conduct. We hope that something of interest and 
use, perhaps even something of adventure, will be found in 
our account. 

This has not been an easy book to write. Our aim, at the 
outset, was clear enough: we wanted to construct an elemen- 
tary text in psychology that would be suitable for our own 
use and, hopefully, for the use of a rapidly growing number 
of like-minded teachers. We had felt, for some time, the need 
for a book that would integrate classical and contemporary 
thought in a way that would adequately represent the dom- 
inant theoretical trend of today. But when, at last, we under- 
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took to write it ourselves, we soon became aware of the 
difficulties involved. We had no models to work from, no 
tradition to follow, at the undergraduate level of exposition. 
With respect to the content as well as the form of our text, 
we had to rely upon our own judgment, again and again 
often with misgivings. 

We found, too, that several readers had to be kept before 
us as we wrote. There was the beginning student, the object 
of our greatest concern, who might never go beyond an intro- 
ductory course. There was the advanced undergraduate, even 
the graduate student, who had not yet been acquainted with 
our way of thinking. And, finally, there were our colleagues, 
watching us to see how well we would be able to carry rein- 
forcement theory to the student. 

Now, with our book in the hands of the publisher, we are 
well aware that it reflects our struggle with these problems, 
rather than their complete solution. No one could think 
otherwise. We will have to be content, at present, with some- 
thing less than the loaf we intended to put on your table. On 
the other hand, we think there is something nutritious in it. 
How well it has been kneaded, and how well baked, the 
reader must judge. 

All books are indebted to more people, for more things, 
than the authors can remember. At this juncture, we recall 
most clearly Richard M. Elliott, whose confidence in our 
project never faltered and who was most generous with his 
reinforcements; Thomas W. Reese and Kenneth MacCor- 
quodale who, with gentle painstaking, rooted out errors and 
added their thoughts to our own at many points; and Murray 
Sidman and James A. Dinsmoor who, with great kindness, 
took up the double burden of illustration and indexing. More 
than by any other man, we have been guided and inspired by 
the work and thought of Burrhus F. Skinner, who led us into 
the field of behavior theory, and whose influence has so per- 
fused this field as to be lost to a proper contemporary perspec- 
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live. We are aware, too, of how much we owe to our own 
studentsthose men and women, working alongside us~ 
whose vitality and creativity have been a recurrent source of 
our own enthusiasm. 

F.S.K. 

W.N.S. 



Pour atteindre a la verit, il faut une fois dans 
sa vie se dfaire de toutes les opinions que Ton a 
revues et reconstruire de nouveau, et ds le fonde- 
ment, tous les systemes de ses connaissances. 

Rene Descartes 
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PSYCHOLOGY AND THE REFLEX 

NATURE WITH its myriad phenomena assumes a unified aspect only in the 
rarest cases; in the majority of instances it exhibits a thoroughly com- 
posite character. . . ; it is accordingly one of the duties of science to 
conceive phenomena as made up of sets of partial phenomena, and at 
first to study these partial phenomena in their purity. 

P. Volkmann, Erkenntnistheoretische 

Grundzuge der Naturwissenschaft, 1896 

Preliminary 

The purpose of this text is threefold: (i) to acquaint you 
with a number of well-established psychological principles; 
(2) to show you how these principles are related, one to the 
other; and (3) to suggest how you may apply them in the 
analysis of everyday human activity. In addition, we hope that 
you will draw from it some notion of where the frontiers of 
our science lie, of what important problems remain to be 
solved, and, perhaps, an idea of how you can help in further- 
ing their solution. 

With certain matters of theory and practice commonly 
considered in elementary texts, we shall not be concerned. 
You will find in this book no chapters on the workings of the 
human nervous system, the structure and function of our 
sense organs, or the measurement of intelligence. These are 
interesting matters for advanced study, but they do not rightly 
belong within a text devoted to fundamental problems and 
basic principles of psychology. 

For the general reader, there are excellent surveys and 
samplings of the various fields and problems of psychology, 
which provide a bird's-eye view of the science. For the ad- 
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vanced student, there are many books that deal exclusively 
with this or that area of specialized interest. The present text 
belongs within neither category. It aims simply to furnish 
the beginning student with a solid foundation and a trust- 
worthy framework for thinking about, or dealing with, human 
nature as he finds it in his daily life. And, needless to say, it 
should also prepare him for further, more intensive, study 
in our field. 

As you make your way, step by step, through this book, you 
will notice certain outstanding characteristics of our approach. 
These may well be mentioned in advance, as a preparation 
for things to come. Our approach is biological, experimental, 
and systematic. Biological, in that our basic principles will 
often be drawn from the study of animal behavior, and will 
be found to apply at various evolutionary levels; experi- 
mental, in that these principles will be derived, not from 
casual observation or untested opinion, but from laboratory 
studies in which the important factors are isolated and varied 
in such a manner as to permit scientific lawfulness to be dis- 
covered; and systematic, in that the interrelation of experi- 
mental facts will be one of our major concerns. 

The Subject Matter of Psychology 

Tentatively we may define psychology as the science of the 
behavior of organisms. Such a simple statement, however, is 
both incomplete and misleading. Psychologists are not, as a 
rule, equally interested in the behavior of all organisms. 
Their attention is usually focussed primarily upon the human 
being and a few of his near-relations in the animal kingdom 
for example, the rat, the cat, the dog, the monkey, and the 
chimpanzee. The comparative psychologist makes it his busi- 
ness to examine differences and similarities of behavior 
throughout the evolutionary scale. We, however, shall have 
little interest in the activities of animals markedly different 
from man, or in those infra-human activities which throw no 
light on human conduct. 
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Moreover, the psychologist studies behavior in its relation 
to environment. Behavior alone would hardly constitute the 
subject matter of a science. Imagine, for a moment, the sense- 
lessness of a motion-picture record of an organism's behavior 
from birth to death, with every indication of the world in 
which it lived carefully blotted out! Only when we begin to 
relate behavioral to environmental happenings does the possi- 
bility of a scientific psychology make its appearance. 

Stimulus and Response 

But we must go still further. Behavior and environment are 
unwieldy terms, too broad in their meaning to be very useful. 
As soon as we attempt to deal with either, we find ourselves 
asking What sort of behavior? What aspect of environment? 
This is but another way of saying that, whenever we try to 
describe either the behavior or the environment of an or- 
ganism, we are forced to break it down into parts. Analysis is 
essential to description, in our science as well as others. 

Through analysis, psychologists have arrived at the concepts 
of stimulus and response. A stimulus may be provision- 
ally defined as "a part, or change in a part, of the environ- 
ment/' and a response may be defined as "a part, or change 
in a part, of behavior/' We shall recognize, however, that 
a stimulus cannot be defined independently of a response. 
An environmental event becomes a stimulus by virtue of the 
fact that it is followed by a response. Activities of our muscles 
and glands (the so-called bodily effectors) make up our re- 
sponses. These terms provide the specificity we desire and 
make a study of the environment-behavior relation feasible 
by giving us things to observe and to measure. Stimuli and 
responses are the basic units of our descriptions and provide 
the starting point for a science of behavior. We would not 
go far astray in asserting that modern psychology itself is 
essentially a stimulus-response psychology. 
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The Reflex 

Physiologists have supplied us with a very useful word with 
which to designate any specific stimulus-response relation- 
ship. The word is reflex, and, in the chapters to follow, we 
shall employ it frequently. For example, we shall speak of 
the salivary reflex when we refer to the fact that food, in the 
mouth of a hungry man, quickly evokes a flow of saliva. We 
shall even extend the use of the term to denote responses for 
which related stimuli are not clearly observable. Thus, we 
shall give the name reflex to the response made by a child in 
clapping his hands, even when we have no way of knowing 
exactly what stimuli are responsible for the act. But more of 
this later. 

The Response Mechanism 

Between the stimulus and the response there are obviously 
bodily happenings. It is well recognized today that stimuli 
affect sense organs (called receptors) and that sense organs, 
when excited, cause nerve impulses to be transmitted over 
nerve pathways to the brain or spinal cord and thence to the 
muscles and glands (effectors). This sequence of events results 
in those responses which we, as psychologists, are interested in 
relating to stimuli. The structures involved the receptors, 
the nervous system, and the effectors make up what is com- 
monly known as the response mechanism. 

The detailed function of the response mechanism is of 
principal concern to the physiologist. He tells us that the 
receptors or sense organs are of three main types: 

1. Exteroceptors, small structures within the eye, the ear, 
or the skin, which are excited by stimuli external to the 
organism. 

2. Inter oceptors, tiny organs which lie mainly within the 
alimentary tract, and which are excited by internal stimuli. 

3. Proprioceptors, which are located within the muscles, 
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joints, and tendons, as well as the inner ear, and are excited, 
that is, stimulated, by movements of the body or its parts. 

In line with this distinction, stimuli themselves are often 
described as exteroceptive, interoceptive, or proprioceptive. 
This gives us greater specificity than is conveyed merely by 
the word environment. 

We are also told that the nervous system has subdivisions. 
One part of it, the somatic nervous system, serves primarily 
to transmit nerve impulses from the sense organs to those 
muscles of the body which are employed in walking, talk- 
ing, writing, and the like responses which are highly spe- 
cific and of great importance in making changes in our 
environment. The other part, the autonomic nervous system, 
is essentially an extension of the somatic nervous system, 
but serves mainly for the conduction of nerve impulses to 
the glands and the so-called "smooth" muscles of our bodies 
for example, the digestive glands and the muscles of the 
blood-vessel walls, the stomach, and the iris of the eye. The 
action of part of this autonomic system is extremely diffuse 
and is most conspicuously responsible for the widespread 
internal changes which occur in all of us when under strong 
emotion. 

The Reflex Arc and the Reflex 

Anatomists and physiologists have analyzed in detail the 
structure and function of the response mechanism. Their 
interest in filling the gap between stimulus and response has 
led them to the conception of a reflex arc, in order to dis- 
tinguish it from the observed relation of stimulus to response 
which we have called the reflex. The simplest conceivable 
chain of structures between our two end-terms, stimulus and 
response, is presumably one that includes (i) a receptor ele- 
ment (cells or cell groups in the sense-organ tissues), (2) a 
sensory or afferent nerve element (nerve cells or neurons) 
for conducting impulses to some nerve center in either the 
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brain or the spinal cord, (3) a motor or efferent nerve element 
for conduction from nerve center to effector, and (4) the 
effector itself, a muscular or glandular element (muscle or 
gland cells) that accounts for the final response. 

This chain or arc of structural elements receptor cells, 
sensory and motor nerve cells, and effector cells has some- 
times been called a reflex, and some theorists have maintained 
that it is the organic basis of the stimulus-response relation. 
We need not here examine the evidence for such an argument, 
which has often been disputed, but it is important that we 
make a clear distinction between the reflex as a relation actu- 
ally observed and the reflex as a hypothetical mechanism. 
Confusion will be avoided if we use reflex in the former case, 
and reflex arc in the latter. 

The newcomer to psychology will gain little from further 
discussion of physiological matters. It is true that psychologists 
have long been interested in relating behavior to receptor, 
effector, and nervous-system function; and physiological psy- 
chology is today a thriving field of study for the advanced 
student. But as far as general principles of behavior are con- 
cerned, this area of investigation has not as yet been very 
helpful. It is, in fact, the case that our knowledge of the 
stimulus-response relation has more often preceded than fol- 
lowed our knowledge of its physiological counterpart. 

Extension of the Reflex Concept 

The reflex, as well as the reflex-arc, concept grew up in 
physiological science. Muscular movement, originally attrib- 
uted to supernatural forces and later ascribed to a power 
residing within the muscles themselves, was gradually recog- 
nized as due to the action of successively excited parts of the 
response mechanism. Also, the importance of an initiating 
stimulus was demonstrated, and it became clear that many 
animal reactions were a direct and almost inevitable result 
of stimulus presentation. Observation of such phenomena as 
the writhings of a decapitated lizard in response to mildly 
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injurious stimulation of its skin led to the search for stimuli 
to other reactions of operated animals, and soon disclosed 
a considerable number of comparable stimulus-response rela- 
tions. The advocates of scientific determinism were thus en- 
couraged to extend their researches into the realm of normal, 
intact animals and, eventually, to man himself. It became 
obvious that, regardless of the physiological activities in- 
volved, stimuli and responses were often associated in a defi- 
nite and openly observable cause-effect sequence. Lists of 
animal and human reflexes were compiled, and psychologists, 
as well as physiologists, turned to the task of further explora- 
tion and study. 

By 1919, it was possible for John B. Watson, an American 
pioneer in the objective, natural-science approach to psychol- 
ogy, to suggest that the identification, enumeration, and classi- 
fication of stimulus-response relations was the principal 
concern of our science. In effect, he proposed a cataloguing 
of reflexes as basic to the prediction and control of human 
and infra-human behavior. In the furtherance of such a 
program, Watson himself made extensive observations on 
the 'unlearned' reactions of newly-born human infants to 
various simple forms of stimulation. 

Watson's proposal is now deemed impracticable. Even if we 
assumed that the entire reflex repertory of a given organism 
might some day be determined (which is very unlikely in 
view of the effects of training and the extreme difficulty in 
identifying the stimuli for many common forms of response), 
we would still be unable to formulate general principles of 
the sort that we require. We need a dynamic, rather than a 
static, picture of the behavior of organisms. To describe proc- 
ess, not to inventory elements, is our major concern. A more 
productive approach has been one that takes for granted the 
basically reflex character of our behavior, selects a few ex- 
amples for observation, and examines their dynamic prop- 
erties in some detail. This is the approach that will be taken 
in the present text. 
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Some Reflex Properties 

We begin our account with a few facts that are so simple 
and ubiquitous that they might easily be overlooked. Yet they 
actually have the status of fundamental principles. They are 
readily demonstrable in any experimental situation where 
the intensity of a stimulus can be controlled and the magni- 
tude of a response can be measured, and they may also be 
crudely observed under the non-quantitative circumstances of 
everyday life. 

Take, as an example, the scratch reflex of a spinal dog in 
the laboratory of a physiologist. In such an animal, a severed 
spinal cord permits movements of the hind legs which are 
free from any influence stimulating the fore part of the body. 
When a touch stimulus is applied to the dog's hind ribs, a 
rhythmic flexion and extension of the right hind leg will 
take place. This response resembles a normal dog's reaction 
to the bite of a flea, but since it is uncomplicated by the effect 
of any stimuli concurrently exciting the dog's eyes or ears, 
it has a somewhat artificial appearance. Nevertheless, it is 
easily elicited and very useful in illustrating properties of 
reflex action in general. 

The Threshold or Limen 

One of the first facts that may be demonstrated with such 
a reduced animal is that a stimulus must be of a certain in- 
tensity before it can elicit a response. The intensity just suffi- 
cient to excite is called a liminal or threshold intensity. Lesser 
intensities are called below-threshold or sub-liminal since, 
when applied singly, they never evoke a reaction. Correspond- 
ingly, greater intensities are called above-threshold or supra- 
liminal. 

All of the elicited responses of organisms (either operated 
or intact) show this dependence upon stimulus level, and 
great effort has been expended throughout the past century 
in determining the liminal intensities of visual, tactual, 
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auditory, and other stimuli. Indeed, an entire field of re- 
search, psychophysics, has been oriented about such measure- 
ments. Workers in this field have dealt not only with absolute 
thresholds, the kind defined above, but also with difference 
thresholds, in which one studies the organism's capacity to 
detect differences in the intensity of stimuli that are at the 
outset well above absolute-threshold value. 

We shall come back to thresholds later, in Chapter 5. At 
present, it is enough to note that many common observations 
point to their existence. When we raise a watch to our ear 
to hear the tick, when we find the first star in the evening sky, 
or when we observe the dimming of theatre lights, we are 
dealing with the fact of absolute or relative intensity limens. 
Other instances will quickly occur to anyone who thinks 
about the matter. 

Latency 

A short interval of time elapses between the application of 
a stimulus and the arousal of a response. This interval is 
called the latency or latent period. In the scratch reflex of 
the spinal dog, the interval may range from 140 ms. (milli- 
seconds ) to 500 ms. (half a second) , depending upon the in- 
tensity of the stimulus employed. In such a reflex, strong 
stimuli reduce the latent period and weak ones lengthen it. 
Other reflexes may show a different range of latency values, 
but for any single reflex of this sort conditions may be so con- 
trolled as to produce a fairly constant value. This constancy 
is important since it gives us a measure of the effect of chang- 
ing the conditions. 

Although this relation is quite dependable in such cases 
as that of the spinal dog's scratch reflex, there is another type 
of stimulus-response connection where changes in stimulus 
intensity are not so clearly reflected in alterations of latency. 
When the driver of a car responds to a green light by making 
the movements which set his car in motion, the latency of 
the response does not appear to be directly related to the 
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intensity of the light. He starts up as quickly for a weak light 
as for a strong one. The process involved in such a case 
requires, as we shall see, its own explanatory treatment. 

We shall return to the matter of latency in Chapter 5, when 
we discuss some experimental studies of reaction time a. term 
that may already have suggested itself to you. We shall then 
see that latency and reaction time are not unrelated, the 
latter being a variation of the former. For the present, how- 
ever, the distinction should not be prematurely pressed. 

Stimulus Intensity and Response Magnitude 

If the scratch reflex of our spinal dog is elicited by a barely 
supra-liminal stimulus, the latency, as stated above, will be 
relatively long. This will not be the only effect observed. 
The flexion of the leg may be limited to a single short excur- 
sion or, at most, to two or three. If, now, the stimulus in- 
tensity is raised to a higher value, there will ensue an increase 
in both the amplitude and number of scratching movements. 
Within limits, the magnitude of the elicited response is de- 
pendent upon, that is, is a function of, the intensity of the 
stimulus. 

As in the case of latency, however, the response magnitude 
is not always seen to bear such a neat relation to the stimulus 
variable. In a snake-infested cave, a faintly heard rattle may 
evoke as vigorous a reaction as the report of a revolver; in a 
tropical jungle, the hum of a mosquito may sometimes elicit 
as strong an emotional response as the roar of a near-by plane. 
Such observations point to other controlling factors, and the 
laboratory bears this out. A different, but nonetheless lawful, 
relation will occupy our attention later. 

Reflex Strength 

There are properties of reflex action other than those just 
described, and they too may be observed in the behavior of 
a spinal animal. We need not, however, consider them in this 
text, since we are here concerned with bare essentials, and 
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FIG. i. Schematic tape records of three reflex properties. The height 
of a spike on the stimulus line indicates stimulus intensity; that on the 
response line, response magnitude. The distance between the stimulus 
and response spikes on the constant-speed tape gives the latency. The 
time line is made by an automatic marker which ticks off any desired 
unit of time. The broken threshold tape shows that the stimulations 
were very widely spaced to prevent possible summation of stimulus 
effects. 
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we already have enough facts to take the next step in our 
thinking. 

Returning to the scratch reflex again, let us ask ourselves 
a question. What happens to threshold, latency, and response 
magnitude when we apply a stimulus not once but many 
times and in fairly quick succession? A moment's thought may 
suggest the answer. The reflex undergoes fatigue. This shows 
itself on the observational level. The latency increases, the 
magnitude of successive 'scratchings' becomes less and less, 
and the stimulus intensity required for threshold excitation 
heightens appreciably. Moreover, these changes take place 
together. 

Since other procedures or "experimental operations" than 
that of repeating stimulations will also produce concurrent 
changes in reflex properties, it will be useful to have a single 
term to denote such changes. Reflex strength has been sug- 
gested for this purpose and will be adopted here. We shall, 
then, speak of a strong reflex when there is a relatively low 
threshold of stimulation, a relatively short latency, and a 
relatively large magnitude of response. Conversely, a weak 
reflex will be considered as one with a high threshold, a long 
latency, and a small magnitude of response. Frequently, we 
shall take as our indicator or measure of strength either the 
latency or the response magnitude alone, without bothering 
to consider the related values of other properties. 

It has already been mentioned, in passing, that the term 
reflex will often be applied in this book to responses which are 
not observably related to any stimuli. In such cases, where 
concepts like latency and threshold can have no meaning 
(since they depend upon a stimulus-response relation), we 
shall nevertheless speak of reflex strength. Frequency of 
occurrence will then become a very important measure of 
strength. Until we come to Chapter 3, however, this matter 
needs no further discussion. 
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A Few Remarks 

So much by way of preliminaries. This is a short chapter, 
but you should now be able to say something about (i) the 
general aim of this text; (2) the subject matter of psychology; 
(3) the concept of the reflex; (4) the nature of a few basic 
reflex properties; and (5) the meaning of reflex strength. All 
this is essential to your understanding of what will be treated 
in the pages to come. Each step in our exposition will depend 
upon what has gone before. Unless you have understood the 
earlier facts and principles, you will almost surely have 
trouble with the later. For this reason it will often be helpful 
for you to go over again a paragraph, a section, or a chapter 
previously read, in order to strengthen your foundation for 
the topic at hand or the topics to come. 

It will also be wise, in reading the coming chapters, to 
divest yourself, as far as possible, from preconceptions con- 
cerning psychology. Our everyday language is shot through 
with purportedly psychological terms and concepts. Most of 
these are lacking in scientific significance, because they are 
either poorly defined or unrelated to anything else, and they 
will find no place in our account. You are asked here, as you 
would be in physics, chemistry, or biology, to learn a new 
language. The power of this language can be appreciated 
only by the one who applies it consistently and rigorously, 
avoiding the contamination of ill-defined or long-discarded 
terms. 

NOTES 

In the back of this book, along with the subject index, is an alphabet- 
ical list of some of the more important books and articles to which we 
have referred in preparing our chapters. We have adopted the practice 
of mentioning, in the text, authors' names and the dates of their publica- 
tions (e.g., Hilgard and Marquis, 1940). The complete reference is 
provided in our list thus "HILGARD, E. R., and MARQUIS, D. G. (1940). 
Conditioning and learning. New York, Appleton-Century-Crofts, xi, 429 
pp." After each reference you will find, in italics, the page numbers on 
which the contribution is cited in this text. 
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The introduction to psychology provided by this book is something of 
a departure from the usual approach, and you may at some time be 
interested in comparing it with other texts. Three very popular elemen- 
tary accounts are those by Munn (1946), Ruch (1948), and Woodworth 
and Marquis (1947). Two other well-known books, by Garrett (1941) 
and by Crafts, Schneirla, Robinson, and Gilbert (1938), are often used 
as collateral reading in beginners* courses. Any one of these will give 
you a good idea of the range of topics to which students are usually 
exposed in their first course, but in none of them will you find a serious 
attempt to show how the topics are interrelated or integrated within a 
unified whole. Only a few authors have made an effort similar to ours 
(e.g., Muenzinger, 1942). 

The systematic position portrayed in the present text is best described 
as reinforcement theory t which is the dominant viewpoint in modern 
behavior theory. Our own account leans heaviest on the work of B. F. 
Skinner (The behavior of organisms, 1938), but is not uninfluenced by 
other expositions as found, for example, in the writings of E. L. Thorn- 
dike (1911, 1932) and the recent teachings of C. L. Hull (1943). 

The student who does not mind reading something of a more tech- 
nical stripe than we offer may go to Skinner's book. There, in the first 
two chapters, he will find further discussion of such topics as the data of 
psychology, the need for analysis, the concept of the reflex, the structure 
of psychological theory, and related matters. The viewpoint in Hull's 
Principles of behavior (1943) provides a stimulating contrast. 

Our systematic position has not kept us from looking at facts in other 
theoretical contexts. Good data are good data, regardless of theory, and 
we have been happy to draw upon the experimental findings of our 
colleagues whenever it seemed profitable, regardless of the viewpoint 
that guided their research. Therefore, if you are one who likes to "go to 
the original," do not be surprised if you occasionally find an article 
written in terms with which you are unfamiliar. If you can push aside 
the interpretations and discussions, and attend to the experimental 
procedures and results, you will see why we thought the article worth 
consideration. Later on, if your interest in psychology continues, you 
will be better able to evaluate the explanations and debated points. 
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TAKE A hungry horse; expose the duct of the parotid gland on the side 
of the jaw; nothing will come out the gland is at rest. Now, show the 
horse some hay, or better, make some movement indicating that his meal 
is coming. Immediately a steady flow of saliva will start . . . 

Claude Bernard. La Science Exptrimentale, 1878 

Pavlov and the Conditioned Reflex 

In the history of science, it often happens that the facts, 
principles, or methods of one field are put to account in the 
development of another. This is especially true of a young 
science. In psychology, the boundaries of which have only 
recently been established, borrowings from other fields have 
been numerous. Since 1879, when Wilhelm Wundt set up 
the first psychological laboratory in Leipzig, Germany, our 
science has often drawn upon her elder sister, physiology. An 
example of this was cited in Chapter i: the concept of the 
reflex actually emerged as a result of purely physiological 
interest in the functioning of organisms. 

We must now acknowledge another debt, again to physi- 
ology, for one of our keystone concepts. This is the principle 
of the conditioned reflex, first clearly stated by the Russian 
physiologist, Ivan Petrovich Pavlov (1849-1936), as the out- 
come of investigations begun in Petrograd (now Leningrad) 
during the closing years of the last century. 

In 1904, Pavlov received the Nobel Prize in medicine for 
his studies of the digestive activity of dogs. During these 
studies, he noticed something that suggested to him an experi- 
mental solution of some of the problems of brain function. 
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Drawing about him a number of co-workers, he soon launched 
a large-scale program of research a program that took up 
the remaining years of his life and won him grateful recog- 
nition from biological scientists throughout the world. Psy- 
chologists, however, have profited more from this research 
through the light it shed upon behavior than through the 
speculations Pavlov advanced concerning brain function. 

Pavlov's basic observations were simple. If food or certain 
dilute acids are put in the mouth of a hungry dog, a flow of 
saliva from the appropriate glands will soon begin. This is 
the salivary reflex, long known to exist in various animals, 
including man. But this is not all. Pavlov noted, like others 
before him, that the animal would also salivate when food 
had not yet reached the mouth: food seen or food smelled 
would elicit the same response. Also, the dog would salivate 
merely upon the appearance of the man who usually brought 
his food. 

For Pavlov, these observations raised important experi- 
mental questions. How did it happen that the mere sight of 
the person who fed the dog was enough to evoke a salivary 
secretion? Surely, this was not an innate or inborn stimulus- 
response relation, typical of all dogs and as uneducable as 
the scratch reflex of a spinal animal. On the contrary, it 
seemed obvious that the effect of such a pre-food stimulus 
could be understood only in terms of the individual experi- 
ence of the organism. Somehow, an originally ineffective 
stimulus for salivary response must have taken on new signifi- 
cance for this animal; it must have come to signalize the 
approach of food. Also, it seemed to prepare the animal for 
the food by starting the digestive process. 

This led Pavlov to develop an experimental method for 
studying the acquisition of new stimulus-response connec- 
tions. In practice, this method requires no small degree of 
laboratory control and technical skill, but it may be outlined 
rather simply. First, a normal dog is familiarized with the 
experimental situation until he shows no disturbance when 
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placed in a harness and left alone in a room especially de- 
signed to cut off unwanted outside stimuli. A small opening 
or fistula is made in the dog's cheek near the duct of one of 
the salivary glands. When the fistula is healed, a glass funnel 
is carefully cemented to the outside of the cheek so that it 
will draw off the saliva whenever the gland is activated. From 
the funnel, the saliva then flows into a glass container or falls, 
drop by drop, upon a lightly balanced recording platform. 
The magnitude of responses to various stimuli can be meas- 
ured by the total volume or the number of drops secreted in 
a given unit of time. The experimenter, who sits in an ad- 
joining room, can make his measurements, apply what stimuli 
he desires (including food), and observe the dog's behavior 
through a window. 




FIG. 2. Representation of a Pavlovian situation for conditioning the 
salivary response in a dog. (After Dashiell, 1949.) 

When everything is ready, the dog is exposed, on successive 
occasions, to a pair of stimuli. One stimulus, say a small por- 
tion of powdered food, initially elicits a flow of saliva each 
time that it appears and the dog eats. The other, say a tone, 
has no such effect, but may cause some other behavior, perhaps 
a twitching of the ears or a turning of the head toward the 
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source of sound. The combination of the two stimuli is pre- 
icnted at irregular intervals over a period of days, always at 
a time when the dog is hungry. The purpose, of course, is 
to determine whether one stimulus (the tone) will acquire 
the power of eliciting the same response as the other (the 
food). So, after a certain number of pairings, the originally 
ineffective stimulus (tone) is presented alone, to see if it will 
produce salivation. 

Table I gives data from an experiment by Anrep (1920), 
one of Pavlov's pupils; and it shows what happens when such 
an experiment is carried out. In this study, a tone of 637.5 

Table I 

ACQUISITION OF A CONDITIONED SALIVARY REFLEX 
(Anrep, 19*0) 

Number of Paired Response Magnitude Response Latency 

Stimulations (Drops of Saliva) (Seconds) 

i o 

10 6 18 

20 20 9 

30 60 s 

40 62 i 

50 59 

cycles per second was sounded for a five-second stimulation 
period; two or three seconds later the dog was given biscuit 
powder. At intervals of five to thirty-five minutes, this pairing 
was repeated. In sixteen days, fifty such combinations were 
presented and six tests were made with tone alone. The test 
tone was of thirty seconds' duration, and Anrep measured 
response magnitude by the number of drops of saliva that 
were secreted in this period. In addition, he recorded the 
latencies of the response, in seconds. 

From this table you can see that the amount of salivation 
in response to tone-alone increased from a zero value, after 
a single combination, to sixty drops in the test interval follow- 
ing the thirtieth combination. Along with this increase in 
response magnitude, there was a decrease in the latency of 
the response to tone, from eighteen to two seconds. Little 
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change occurred as a result of further pairings of tone with 
food, showing that the tone-salivation tie-up was well estab- 
lished by the thirtieth combination. 

Experiments like this led Pavlov to formulate a new prin* 
cipie: 

If any casual stimuli happen, once or a few times, to accompany 
stimuli which elicit definite inborn reflexes, the former stimuli begin 

of themselves to produce the effect of these inborn reflexes We 

designate the two sorts of reflexes, and the stimuli by which they arc 
elicited, as unconditioned (inborn) and conditioned (acquired) re- 
spectively. (Pavlov, 19*3, translated by E. B. Holt in Animal drive and 
the learning process f 1931, p. 24) . 

A schematic picture or paradigm of Pavlovian "condition- 
ing" may be helpful at this point. 

s (food) " R < sallvatlon > 

r (ear-twitch Ing) 

In this paradigm, three reflexes are represented. The food* 
salivation and the tone-ear-twitching reflexes are "uncondi- 
tioned"; the tone-salivation reflex is "conditioned." The 
letters S and R refer, of course, to stimulus and response. The 
use of r is merely to show that the ear-pricking response to 
tone is of no great importance in this conditioning process; 
it may even disappear during the repeated application of 
the tonal stimulus. The important response, and the one 
that is measured, is the one belonging to the salivary reflex. 
Early in their studies, Pavlov and his students found that 
this sort of conditioning could occur only when the food- 
salivation reflex was stronger than the reflex elicited by the 
"casual" stimulus. For example, an intense electric shock 
(rather than a tone, a light, or a touch) would not become a 
conditioned stimulus for salivation because it produced a 
violent emotional upset in the animal. This led Pavlov to say 
that a conditioned reflex must always be based upon an 
unconditioned reflex that was "biologically more important" 
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or "physiologically stronger." The stronger of the two uncon- 
ditioned reflexes is the one that strengthens or reinforces 
the new stimulus-response relation. The stimulus of the 
stronger unconditioned reflex is often called the 4 'reinforc- 
ing stimulus." 

Pavlov's principle has been restated, by Skinner (1938), in 
a way that highlights the importance of the reinforcing stim- 
ulus, and points up the fact that a new reflex is formed by 
combining elements of the two that were already present in 
the organism's repertory. 

The approximately simultaneous presentation of two stimuli, one 
of which (the "reinforcing" stimulus) belongs to a reflex existing at 
the moment at some strength, may produce an increase in the strength 
of a third reflex composed of the response of the reinforcing reflex 
and the other stimulus. (Skinner, The behavior of organisms, 1938, 
p. 18.) 

Temporal Factors in Conditioning 

In the above statement, as in Pavlov's, a close relation of the 
two stimuli in time is specified. One stimulus is to "accom- 
pany" or be "approximately simultaneous with" the other. 
We are tempted to ask further questions about this relation. 
Does conditioning proceed more rapidly with simultaneous 
than with successive stimulus presentations? If successive 
stimulation is effective, which of the two stimuli should come 
first for best results? Is conditioning still possible when con- 
siderable time elapses between the two? 

Answers to these questions have been sought by several 
investigators and we now know that a strict simultaneity of 
the two stimuli is unnecessary for the rapid development of a 
conditioned reflex; and that a close succession of stimuli, one 
being presented two or three seconds after the other, is prob- 
ably the most effective arrangement of all. We know, too, 
that a conditioned reflex is set up only with very great diffi- 
culty, if at all, when the conditioned stimulus follows the 
unconditioned, even by a fraction of a second. In terms of 
our tone-food example, the tone should precede the food 
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(as it did in Anrep's experiment) if the conditioning proce- 
dure is to take effect. 

As to how far in advance of the unconditioned stimulus 
the other one may come, research does not yet give a final 
answer. What evidence we have makes it seem likely that a 
limit would soon be reached. Two types of Pavlovian proce- 
dure bear upon this problem. In one, the salivary response is 
first conditioned to a sound or some other stimulus by the 
method of "simultaneous" presentation. Then, as the pairing 
continues, the unconditioned stimulus is not provided until 
the conditioned stimulus has been steadily present for a given 
period of time, say three minutes. Eventually, under such 
circumstances, a "delayed" conditioned reflex may be estab- 
lished: the animal will respond with salivation only after the 
conditioned stimulus has been present for two minutes or 
more of the three-minute interval. One is led to say that he 
can now "tell the time" with considerable accuracy. 

The second type of procedure is similar to the first, but 
with one important difference: the conditioned stimulus is 
not maintained continuously during the interval of delay, but 
is presented only at the beginning of the interval. As in the 
case of the delayed reflex, however, long-continued pairings 
of this sort will bring about a temporal discrimination: the 
dog will not salivate until the time for reinforcement ap- 
proaches. Pavlov called this a "trace" conditioned reflex, 
arguing that the immediate cause of salivation was some trace 
of the conditioned stimulus that had been left in the nervous 
system of the animal. 

Related to these two procedures, because of the time dis- 
crimination shown, are the following observations, also made 
in Pavlov's laboratory, (i) A dog was fed regularly at thirty- 
minute intervals. When this routine had been well estab- 
lished, food was withheld at the usual feeding time. Salivation 
was nevertheless noted to occur at approximately the end of 
the thirty-minute period the time when the food would 
ordinarily have been provided. In Pavlov's terms, a "time 
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reflex" was formed. (2) In another case, the same thirty- 
minute interval between feedings was used, but the food was 
always presented to the accompaniment of a metronome beat. 
After repeated pairings of metronome and food, salivation 
was conditioned to the sound, as you would expect, but it 
was also dependent upon the time-since-feeding. If the metro- 
nome was sounded alone early in the period between feedings, 
no salivation would occur; if it came slightly later, a small 
magnitude of response might be produced; and, as the end 
of the period approached, the effect would be correspondingly 
greater. Finally, with long training, salivation-to-metronome 
was elicitable only at the very end of the between-feeding 
interval; the response was conditioned, so to speak, to metro- 
nome-plus-thirty-minutes. 

These rather astonishing results tell us that the dog can 
make an extremely delicate time discrimination, but they do 
not bear upon the question of the maximal possible delay 
between the conditioned and the unconditioned stimulus. 
The experiments on the delayed and the trace conditioned 
reflex are more to the point. The fact that a three-minute lapse 
between stimuli results in conditioning only after many pair- 
ings probably indicates that the limit was almost reached. 
Certainly, under the usual conditions of Pavlovian experi- 
mentation, in which the pairings of stimuli do not come at 
regular intervals, we would not expect to train a dog to 
salivate to a tone that preceded food by half a day! 

Compound Stimuli 

In Pavlovian conditioning, a relation is established between 
a response and some stimulus that accompanies the reinforc- 
ing stimulus. Why, then, you may ask, does the response link 
itself exclusively to the tone, the light, or the touch provided 
by the experimenter; are there not other stimuli in the 
situation which regularly accompany the presentation of 
food? This is a simple enough question, but the answer 
is complex, having at least two major aspects. First, these 
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"other" stimuli may be present not only when reinforcement 
is given, but also under conditions of non-reinforcement, in 
which their eliciting power would be expected to dissipate 
itself (in accordance with the principle of extinction, to be 
treated in Chapter 4). Secondly, a number of experiments 
from Pavlov's laboratory have pointed to the fact that when 
certain compounds of stimuli, such as light-plus-sound or 
sound-plus-touch, are regularly paired with food, it is possible 
that only one member of the compound will become a con- 
ditioned stimulus. For example, Palladin conditioned saliva- 
tion to a combination of touch-plus-temperature. Then he 
tested separately the elicitive function of the two components. 
The tactual stimulus was found to elicit as strong a response 
as the compound, but the thermal stimulus was without the 
least effect (Pavlov, 1927). 

Such findings have opened up a brand-new field of research 
in conditioning, but one into which we need not enter here. 
We shall, however, return to the problem in another connec- 
tion, when we consider the stimulus control of another type 
of conditioned response (Chapter 8). 

The Extension of Pavlovian Research 

Pavlov and his collaborators studied many other aspects 
of salivary conditioning than the ones we have mentioned. 
Some of this research is of interest only to the specialist in 
this province, and we may ignore it here. We cannot, however, 
leave the basic principle without some general remarks about 
its extension, its significance, and the influence it has had 
upon psychological thought. 

We know today that the principle may be demonstrated 
in the behavior of many more animals than the dog. Hardly 
a species has been studied in which conditioning cannot be 
established. Even one-celled organisms seem to display sim- 
ilar changeability. Special experimental conditions may be 
needed. Thus rats, guinea pigs, and other small animals re- 
quire apparatus and techniques that are clearly unsuitable 



RESPONDENT CONDITIONING 25 

for human beings. But the broad generality of the principle 
is not to be questioned. 

Extension of the procedure has also involved the use of 
reflexes other than the salivary response to food (or acid) as 
the "biologically stronger/' reinforcing reflex. Since 1916, a 
number of investigators, mostly American, have shown that 
the constriction of the pupil of the human eye, which results 
naturally from stimulation with strong light, can be condi- 
tioned to the sound of a bell, or some other stimulus. Others 
have demonstrated that changes in the electrical resistance of 
the skin (through sweat secretion) , elicited by such stimuli 
as a mild electric shock or a fairly loud buzzer sound, may 
readily serve as a basis for new reflexes. Still others have 
worked with such reinforcing reflexes as blood-vessel constric- 
tion in response to stimulation with cold objects, changes in 
pulse beat resulting from electric shock or skin injuries, 
and so on. 

Many agents have been used as conditioned stimuli, within 
the sense-fields of sight, sound, touch, taste, and smell even 
proprioceptive stimuli, aroused by movements of the legs, 
hands, or arms, have been employed. In several experiments, 
responses have been conditioned to words, either spoken by 
the experimenter or by the subject. A couple of these experi- 
ments may be described briefly because of their intrinsic in- 
terest and their relation to the problem of 'controlling* bodily 
changes. 

Hudgins (1933) seems to have been the first to condition 
a response to self-initiated verbal stimuli. He used, as his 
basic unconditioned reflex, one with which Cason (1922) 
had already worked in human experimentation: the con- 
striction of the pupil of the eye in response to bright light. 
In a rather complicated sequence of stimulus pairings and 
combinings, he was able to condition this pupillary reflex 
to (i) the sound of a bell; (2) a vigorous contraction of the 
hand and fingers; (3) the word contract, as spoken by the 
experimenter; (4) contract, when spoken by the subject him- 
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self; (5) contract, when whispered by the subject; and, finally, 
(6) the subject's silent, or sub-vocal, contract. Which is to say 
that, through a conditioning procedure, the subject came to 
control his own pupillary contraction a feat that is ordinarily 
thought impossible for human beings. 
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NUMBER OF REINFORCEMENT REPETITIONS 

FIG. 4. An acquisition curve showing the magnitude of the condi- 
tioned galvanic skin response after varying numbers of reinforcements. 
The CS was a tone; the US for this respondent was an electric shock 
to the wrist. (After Hull. 1943, based on data of Hovland, 



Menzies (1937) conditioned changes in the skin tempera- 
ture of human subjects by a very ingenious technique. Un- 
conditioned stimulation was applied by immersing a sub- 
ject's hand in a beaker of ice-water, and the measured re- 
sponse was the elicited change in temperature of the sub- 
ject's other hand. (It had been known, since 1858, that a fall 
in the temperature of one hand is regularly accompanied by 
a similar change in the temperature of the other.) With this 
stimulation was paired, in various parts of Menzies' experi- 
ment: (a) the sound of an electric bell or a buzzer; (b) a 
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visual pattern of illuminated crosses; (c) verbal stimuli the 
meaningless word prochaska, spoken aloud by the experi- 
menter and repeated in a whisper by the subject, or merely 
whispered by the subject alone; and (d) the proprioceptive 
stimulation provided by extension of the arm, clenching of 
a fist, or holding the head in a thrown-back position. Condi- 
tioning was effectively established, in from nine to thirty-six 
pairings, for twelve of the fourteen subjects. (In one of the 
two 'failures/ conditioning was doubtful; in the other it 
did not take place, presumably because the unconditioned 
stimulus itself was not always effective.) It was set up to verbal 
stimuli as readily when the subject whispered the word to 
himself as when the whispering was combined with the ex* 
perimenter's spoken word. Moreover, in three subjects who 
had been conditioned to respond to the visual pattern, the 
temperature change could be induced by asking them to 
"recall" or "think about" the stimulus! In short, Menzies 
showed convincingly that a conditioned thermal change could 
be set up easily in his subjects with all of the stimuli that 
he tried, both exteroceptive and proprioceptive. 

Such experiments as these raise some important questions 
concerning the nature of "voluntary control," but this is not 
the place for their consideration. At this point, it is probably 
enough to say that the problem will not be solved on the 
basis of Pavlovian conditioning alone, since this type of con- 
ditioning fails to tell us how the controlling word (contract 
or any other) itself comes to be strengthened. 

Physiologists tell us that all the reflexes thus far mentioned 
are alike in one important respect: they depend upon auto- 
nomic nervous system function. They involve the action of 
glands and smooth muscles (e.g., the secretion of sweat and 
the contraction of blood-vessels). Since the action of such 
effectors is often associated with states of emotion (in "fear," 
the saliva dries up, the sweat pours out, the skin cools, the 
pupils of the eyes dilate, etc.), it will come as no surprise for 
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you to learn, in later chapters, that these states may be con- 
ditioned in Pavlovian fashion. 

Yet, a few reflexes do not require autonomic function and 
may apparently be reinforcing. Foot-withdrawal at the onset 
of electric shock is neither a smooth-muscle nor a glandular 
response, but it has been used as a basic unconditioned reflex 
by Russian and American workers in many experiments. 
Bechterev (1857-1927), one of Pavlov's contemporaries, was 
the first to work extensively with this response and, in this 
country, through the studies of Liddell and others, it has 
become a common form of conditioning procedure. 

The experiments of Liddell and his collaborators, using 
sheep as subjects, are of particular interest in showing the 
relative complexity of the behavior involved in such * 'motor" 
conditioning situations. A common technique in their studies 
is one in which the sound of a metronome is paired with an 
electric shock to the animal's left foreleg. At first, only the 
shock will elicit a flexion of the leg muscles, but, after a few 
stimulus combinations, the beating of the metronome is in 
itself sufficient to evoke the response. (Along with this effect 
there may also be measured a conditioned change in skin 
resistance and breathing rate.) This is, to all appearances, a 
simple and straightforward case of Pavlovian conditioning. 
It may be shown, however, that seemingly minor alterations 
in the experimental procedure are enough to produce dra- 
matic changes in the subject's behavior. In one experiment, 
when shock regularly followed a five-second series of metro- 
nome beats, an increase in the daily number of stimulus pair- 
ings from ten to twenty resulted in an entirely unpredicted 
change in the sheep's behavior. Formerly a steady and tract- 
able animal, he suddenly began to show distinctly "neurotic" 
symptoms. He resisted being led into the laboratory; once 
in the experimental harness, he became quiet, but only for as 
long as the experimenter remained in the room; when left 
alone, and before any stimuli had been applied, he began to 
make foreleg movements as if in expectation of the shock; 
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the effect of the shock, when actually given, was to quiet the 
animal for a minute or more but, as the time for the next 
stimulation approached, the frequency of the leg movements 
increased. In spite of a reinstatement of the earlier experi- 
mental conditions, this deviation from normal behavior be- 
came daily more pronounced, and was alleviated only by a 
long vacation in pasture (Anderson and Liddell, 1935). 

Observations of this sort, on other animals and over a 
period of years, have raised important questions concerning 
the origin, development, and cure of neurotic behavior- 
questions already raised by Pavlov in his studies of the dis- 
criminative capacities of dogs (see Chapter 5), and questions 
to which Liddell himself has given much attention (Liddell, 
1938). They also suggest the presence, in an apparently "pure" 
Pavlovian set-up, of factors which have not as yet been fully 
identified. Certainly the results of such experiments as Lid- 
dell's are strikingly at odds with those obtained in the con- 
ditioning of salivary and other autonomic functions. 

The latter suggestion is supported by several recent dem- 
onstrations of pseudo-conditioning, in which motor responses 
not unlike foot-withdrawal have been employed, and in which 
"conditioning" occurred without any pairing of stimuli. For 
example, Reinwald (1938) has observed that white rats, after 
jumping and running in response to a few electric shocks, 
will react similarly to a tone that was initially without ob- 
servable effect upon their behavior. Had this effect resulted 
from a succession of tone-shock combinations, it could easily 
have been mistaken for true conditioning. Certain strong un- 
conditioned stimuli may, apparently, be so generally disturb* 
ing as to render an organism sensitive to influences which, 
under other circumstances, would not have been felt. 

Indirect evidence of the complexity of supposedly simple 
extensions of Pavlov's technique to responses that involve 
action of the somatic, rather than autonomic, nervous system 
will be presented in the following chapter. You will see that 
it is possible, in some such instances, to point to the operation 
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of another basic principle of conditioning one that clearly 
applies to most of our everyday actions and is often found in 
combination with the one now under discussion. You will also 
be able to understand why one theorist (Skinner, 1938) has 
suggested that Pavlovian conditioning is limited exclusively 
to autonomic responses. 

Respondent Behavior 

Whatever the strengths or limitations of the Pavlovian prin- 
ciple, one point stands out clearly: this type of conditioning 
always depends upon the elicitation of response. Food elicits 
salivation; strong light elicits a pupillary constriction; shock 
elicits foot-withdrawal; and so on. The unconditioned stim- 
ulus is observable, and the basic reflex occurs with a regularity 
and automaticity comparable to the reaction of a spinal dog. 
Also, as with a spinal reflex, strength may be measured in 
terms of such properties as latency and response magnitude. 

The name respondent has been given to stimulus-elicited 
behavior in order to contrast it with behavior for which no 
stimuli can be identified. We have adopted this term and 
will use it in the succeeding pages of this book. By introducing 
it here, we justify the title of the present chapter and pave 
the way for later discussion. Since all the reflexes thus far 
mentioned involve the action of identifiable eliciting stimuli, 
we may use respondent as the equivalent of Pavlovian condi- 
tioning, and we may speak of a respondent when referring 
to a specific instance of such conditioned or unconditioned 
behavior. 

Higher-Order Conditioning 

It was reported from Pavlov's laboratory, early in the 
twenties of this century, that a conditioned reflex, once set 
up, might serve as the unconditioned-reflex basis of another; 
and a distinction was made between primary and secondary, 
or higher-order, conditioning. Frolov, one of Pavlov's co- 
workers, conditioned salivation to both the sound of a buzzer 



RESPONDENT CONDITIONING 91 

and the beat of a metronome. When these two first-ordel 
conditionings were well established, he used them in build- 
ing a second-order reflexsalivation in response to a visual 
stimulus, a black square. Great caution had to be exercised 
in presenting the stimuli: an interval of fifteen seconds had 
to elapse between the black square and the sound of the 
'reinforcing' metronome beat or no conditioning was possible. 
Also, the secondary reflex never became very strong: the 
latency was great and the response magnitude was small. But 
some effect was discernible, in spite of the fact that the black 
square was never paired directly with the original food 
stimulus. 

In another experiment, Foursikov used the foot-withdrawal 
response to electric shock as his basic reflex and was able to 
obtain results that pointed to the possibility of third-order 
conditioning. The withdrawal response was first conditioned 
to a tactual stimulus, then to the sound of bubbling water, 
and, finally, to a tone of 760 cycles per second, with each new 
reflex based exclusively upon the preceding one. This is 
schematized in the three paradigms shown on this page, 
where Roman numerals I, II, and III indicate the succes- 
sively conditioned reflexes. Again, however, the effect re- 
quired highly controlled experimental conditions, was rather 
unstable, and grew less as the order went higher. Also, pro- 
longed attempts by Foursikov to set up a fourth-order reflex 
were entirely without success. 

S (shock) ^ R (foot-withdrawal) 



8 (touch) 



6 (touch) - >-R (foot-withdrawal) 
II 

S (bubbling water) ' 



S (bubbling water) - >- R (foot-withdrawal) 
III "" 



3 (tone)- 
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It is possible that the facts do not clearly prove the existence 
of higher-order conditioning. Conceivably, the findings are 
due to other factors in the situation than stimulus combina- 
tion. In Foursikov's study, one might point to the sensitizing 
effect of electric shock and the similarity of the sound of tone 
to that of bubbling water for at least some of the effects of 
the conditioning procedure. Be this as it may, the influence 
of higher-order conditioning could hardly be expected to play 
much of a part in the everyday behavior of organisms, where 
conditions are seldom well controlled. 

Pavlov's Place in Psychology 

The principle of respondent conditioning, firmly estab- 
lished on an experimental footing, had many repercussions 
in psychology. It appealed especially to the objectivists in the 
field as a welcome replacement for the older, subjective "asso- 
ciation of ideas*' a legacy from British philosophy. Men like 
John B. Watson saw in the concept at least a partial explana- 
tion of the fact that many stimulus-response relations, not 
discoverable in infancy, are present in adult life. Ignoring the 
problem which this raised for anyone who sought to identify, 
in any adult, all the stimuli for his responses, they seized upon 
the principle to show that everyone's behavior repertory is 
the final product of countless stimulus substitutions. Over- 
whelmed by the vision of a natural-science explanation of 
behavior that had previously been attributed to 'psychic* or 
'mental' influence, they forgot for a time that they were at 
the beginning, rather than the end of their labors. 

The apparent demonstration of higher-order conditioning 
gave added impetus to this movement. Overlooking the diffi- 
culties involved in such a demonstration, they accepted the 
experimental findings with alacrity as evidence of the all- 
embracing power of Pavlov's formulation. If the mere com- 
bination of stimuli, even if remote from the one that was 
initially reinforcing, sufficed to set up new stimulus-response 
connections, the very citadel of subjectivity the "higher 
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mental processes" of imagination and thought might soon be 
stormed. 

Pavlov himself, although not unaware of the behavioral 
implications of his work, was more interested in the light he 
thought it shed upon the functions of the brain. Condition- 
ing, for him, depended upon the rigorous control of experi- 
mental variables time of stimulus presentation, number of 
reinforcements, strength of the basic reflex, and other factors 
all of 'which were to be studied in detail by laboratory 
methods. Wherever he looked, he saw problems, the analysis 
of which required research, and more research. On the other 
hand, his most ardent psychological admirers saw only solu- 
tions, answers to age-old questions. When these early enthusi* 
asts recognized any scientific problem, it was merely the old 
one of identifying the stimulus components of every environ- 
mental situation and describing the responses associated there- 
with. And such a problem does not readily give way to 
experimental attack. 

Nowadays we view the matter in a different way. Modern 
psychologists, although less interested in the physiological 
implications of their studies in this field, tend to lean in 
Pavlov's direction. That is, they have wholeheartedly adopted 
his experimental attitude and in general are wary of extend- 
ing the principle into territory not already cleared by labora- 
tory research. Gradually, they have taught us to see the limita- 
tions as well as the strength of Pavlov's work. 

Respondent conditioning is now a well-accepted principle 
of behavior. Pavlov would deserve a place in the history of 
psychology, if for no other reason. Fortunately for us, his 
work did not stop at this point. When we consider, in the 
chapters to come, such concepts as those of "extinction," 
"generalization," and "discrimination," we shall again have 
occasion to pay homage to this Russian genius. He did not 
give us a complete system of behavior. In fact, we shall see 
that other, non-Pavlovian, principles have actually become 
more important in the development of such a system. But he 
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carried us a great step forward in the path we were destined 
to follow in the scientific study of animal and human conduct. 
In retrospect, it is interesting to consider that a physiologist 
should have been the man to do so much in promoting our 
enterprise. We are in no position to weigh his contributions 
within his chosen field; we can say very little about the degree 
to which he cleared up the mystery of brain action; but his 
work will stand for many generations as a landmark in the 
analysis of behavior. 

NOTES 

The best single source-book for English-speaking students of re- 
spondent conditioning is Pavlov's Conditioned reflexes: an investiga- 
tion of the physiological activity of the cerebral cortex, a translation 
from the Russian by G. V. Anrep, published in London by the Oxford 
University Press in 1927. Another book, a collection of Pavlov's Lec- 
tures on Conditioned Reflexes, translated by W. H. Gantt, was pub- 
lished in New York by International Publishers in 1928. Besides the 
lectures themselves, this volume includes a short biography of Pavlov 
(by Gantt) and a bibliography of nearly two hundred different articles 
which emanated from Pavlov's laboratory between 1903 and 1926. 
Before 1927, however, psychologists in this country had a very incom- 
plete picture of Pavlov's work, and his influence was felt only gradually. 

It has been pointed out recently, by Hilgard and Marquis (1940), 
that the work of Vladimir M. Bechterev actually aroused more in- 
terest in this country than did that of Pavlov himself. Bechterev, as 
mentioned in your chapter, dealt with responses like foot-withdrawal 
to electric shock. Since his publications were often in German, and 
occasionally in French, they were more accessible to American readers. 
Having been a student at Leipzig under Wundt, he retained a strong 
interest in psychology and an acquaintance with its problems; and 
he used human as well as animal subjects in his experiments. His most 
important book, Objective psychology, was translated into French and 
German in 1913, and his teachings are now available in English 
(General principles of human reflexology, 1932). Since 1927, however, 
with the appearance in English of Pavlov's monumental work. Bech- 
terev's influence has gradually disappeared. Modifications of his basic 
technique are still used in many laboratories in the United States 
(at Yale, Cornell, Rochester, and Indiana, to mention but a few), but 
researchers have quite generally adopted the terminology and syste- 
matic concepts of Pavlov. 

Among the names associated with the early development of interest 
in Pavlov (and Bechterev) in this country are those of R. M. Yerkes 
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and S. Morgulis (1909), J. B. Watson (1916), K. S. Lashley (1916), 
W. H. Burnham (1917), F. Mateer (1918), S. Smith and E. R. Guthrie 
(1921), H. Cason (1922), and F. H. Allport (1924). You will find an 
excellent review of this development, together with a summary of some 
important Pavlovian concepts, in the first two chapters of Conditioning 
and learning (Hilgard and Marquis, 1940). 

A good review of early studies in stimulus compounding (in Russia 
and elsewhere) is available in a paper by G, H. S. Razran (19390). This 
is one of many reviews by Razran, whose scholarly interests and ac- 
quaintance with the Russian language have permitted him to render 
invaluable service to his American colleagues. Except for the trans- 
lations of Pavlov's books, our principal contact with Russian research 
lias been made through Razran's efforts. 
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SUPPOSE, now, that in putting out its head to seize prey scarcely within 
reach, a creature has repeatedly failed. Suppose that along with the 
group of motor actions approximately adapted to seize prey at this 
distance, ... a slight forward movement of the body [is caused on some 
occasion]. Success will occur instead of failure. . . . On recurrence of 
the circumstances, these muscular movements that were followed by 
success are likely to be repeated: what was at first an accidental com- 
bination of motions will now be a combination having considerable 
probability. 

Herbert Spencer, Principles of Psychology, 1855 



Thorndike and the Law of Effect 

In 1898, five years before the term conditioned reflex ap- 
peared in print, an important psychological monograph was 
published in the United States. Its title was ''Animal Intelli- 
gence: An Experimental Study of the Associative Processes 
in Animals/' and it was written as a doctoral dissertation by 
Edward Lee Thorndike (1874-1949) at Columbia University. 
This study was important for two main reasons: (i) it intro- 
duced an experimental method of investigating the problem- 
solving behavior of animals; and (2) it attempted to account 
for such behavior in terms of associations (stimulus-response 
connections) that were strengthened by their results. 

In Thorndike's experiments, cats, dogs, and chicks were 
subjects, but, in the present account, we shall limit ourselves 
to his studies with cats, which may be taken as typical of his 
work. With these animals, fifteen different forms of ' 'prob- 
lem box" were used as apparatus, representing as many dif- 
ferent problems for solution. Most of the boxes were crate- 
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like affairs, about twenty inches long, fifteen inches wide, and 
twelve inches high. At the top of each was a trap-door through 
which a cat might be dropped into the box, and on one side 
was a door through which he might escape and get to a small 
bit of food (meat or fish) on the floor outside. The door was 
held in a closed position by a bolt or a bar, but could be 
opened from within when some release-mechanisma latch, 
a wire loop, a lever, or some other simple device was properly 
manipulated by the cat. 

The experimental procedure for any given task was roughly 
as follows. A hungry cat was dropped into the box and left 
there until, in the course of his activity, he happened to 
operate the appropriate release-mechanismfor example, 
until he pulled a loop or depressed a lever that opened the 
door. As soon as he left the box and ate the morsel of food 
that awaited him outside, he was taken up by the experi- 
menter and put back in the box, the door of which had again 
been locked. After a second escape and feeding, the procedure 
was repeated; and so on. 

For each problem, Thorndike noted the time required by 
the animal to escape from the box on each successive trial. 
Figure 5 shows, graphically, the number of seconds needed 
by one cat on each of twenty-four trials to make a loop-pulling 
escape response. 

This curve, which is fairly representative of results ob- 
tained with other cats and other problems, helps us to under- 
stand what took place in these studies. First, we see an over- 
all reduction in the time per trial required for the animal to 
get out of the box. The number of seconds needed for the 
first escape was 160; for the twenty-fourth, it was only seven. 
The amount and the rapidity of the drop was greater for 
some problems and some animals than it was for others; and 
there was a wide variation in the number of trials required 
before the escape-time became minimal. Secondly, it appears 
that, in spite of the general decrease of time as the number of 
trials increased, there was considerable irregularity. Setbacks 
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in the animal's progress were fairly common. Thus escape 
from the box on the second trial required but thirty seconds, 
whereas, on trial three, which followed immediately, ninety 
seconds were needed. In some of Thorndike's other experi- 
ments, these irregularities were even more pronounced and 
continued for many trials before a consistent, quick solution 
developed. 
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FIG. 5. Time taken to escape by one of Thorndike's cats on successive 
trials in a problem box. (After Thorndike, 1898.) 

While gathering the time records, Thorndike did not fail 
to observe the behavior of his subjects. He noted, for example, 
that a cat, when first confronted with his problem, commonly 
made vigorous attempts to escape confinement. "It tries to 
squeeze through any opening; it claws and bites at the bars 
or wire; it thrusts its paws out through any opening and claws 
at everything it reaches; . . . For eight or ten minutes it will 
claw and bite and squeeze incessantly." (Psychol. Monogr., 
1898.) 

In the course of such activity, nearly all the cats hit upon 
the response that opened the door and gave access to food. 
When returned to the box for the second test, the struggling 
recommenced, and continued until a second solution was 
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achieved. Gradually (in some cases, rather suddenly), with 
successive confinements and escapes, the amount of useless 
activity diminished and the cat's behavior became clearly 
oriented toward the release-mechanism. Finally, a well-di- 
rected and stereotyped mode of response developed: the 
''problem" was solved. 

According to Thorndike, the solution of such a problem 
by cats and other animals involved the formation of an asso- 
ciation between some aspect of the stimulus-situation, such 
as the wire loop or the wooden lever, with the specific move- 
ment that led to door-opening. Further, he argued, the stimu- 
lus-response relation that finally appeared was obviously in- 
fluenced by the outcome of this movement. The pleasure 
experienced by the animal in getting out of the box and to 
the food served to stamp in the connection between stimulus 
and response that led to the pleasure. By the same token, 
stimulus-response connections that did not lead to a pleasur- 
able after-effect were not strengthened, and tended to drop 
out. 

Here was the first approximation to a basic principle of 
behavior. Thirteen years later, when Thorndike republished 
his monograph as part of a book on Animal intelligence 
(1911), the same idea was presented formally as the Law of 
Effect: 

Of several responses made to the same situation, those which are 
accompanied or closely followed by satisfaction to the animal will, 
other things being equal, be more firmly connected with the situation, 
so that, when it recurs, they will be more likely to recur; those which 
are accompanied or closely followed by discomfort to the animal will, 
other things being equal, have their connections with the situation 
weakened, so that, when it recurs, they will be less likely to occur. 
The greater the satisfaction or discomfort, the greater the strength- 
ening or weakening of the bond. (Thorndike, E. L., Animal intelligence: 
experimental studies. New York, Macmillan, 1911, p. 244.) 

By this time, Thorndike had joined the staff of Teachers 
College at Columbia University and had embarked upon a 
long career of productive research in psychology and educa- 
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tion. A man of strong practical interests, powerful motivation, 
and striking originality, his yearly output of work was enor- 
mous in bulk and varied in content. From his early experi- 
ments with animals, he was led to problems of human learn- 
ing, classroom procedures, dictionary construction, intelli- 
gence testing, and vocational guidance. Wherever he turned, 
he never lost sight of his fundamental principle and, from 
time to time, he brought forth specific evidence of its 
operation. 

The law of effect was not Thorndike's only answer to the 
way learning takes place in animals and human beings. Early 
in his thinking (1898), he recognized a law of exercise accord- 
ing to which connections are strengthened through mere 
repetition and weakened through disuse. By 1932, however, 
he was led by his own research to renounce his former posi- 
tion and to argue against exercise as a factor working inde- 
pendently of effect. Also, in 1913, he proposed a law of readi* 
ness, but this was little more than a guess, in terms of "conduc- 
tion units," as to the physiological conditions underlying the 
operation of his basic principle, and never played a very im- 
portant r&le in his research. Somewhat more interesting and 
more empirically grounded were five subsidiary laws which 
were intended to supplement the primary ones. One of these, 
that of associative shifting, is the Thorndikian counterpart 
of Pavlov's principle, and another (response by analogy) bears 
some resemblance to the principle of generalization with 
which we shall deal in Chapter 5. In the main, however, he 
left these subordinate laws in a relatively undeveloped state 
and we need not give them elaborate treatment here. 

The law of effect was, then, Thorndike's major contribu- 
tion, and the beginner in psychology generally accepts his 
formulation without question. "Trial and error, with acci- 
dental success" seems to describe satisfactorily a great deal of 
problem-solving behavior as he knows it in his everyday life. 
He is usually quite ready to believe that many of his own 
reactions to stimulus situations are firmly implanted because 
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of their effects. In fact, he may tell you that, long before he 
heard of Thorndike, he had assumed the operation of some 
such general principle and could have told you about it if 
you had asked him. How else, he may say, does one find his 
way about in a strange city, 'learn the ropes' of a new occu- 
pation, solve a Chinese puzzle, or master any complicated 
skill, except through the effect of the success that attends 
his trial-and-error behavior? 

You may, therefore, be surprised to hear that Thorndike's 
principle was challenged by psychologists in many quarters. 
How can it be, said one group of critics, that "pleasure" or 
"satisfaction," which are mental states, exercise an influence 
upon such clearly physical phenomena as responses to stimuli? 
How, said others, can the results of an action have any effect 
upon the action itself when the action is over and done with 
before the results are apparent what sort of cause-effect rela- 
tion is this, in which the effect has to function as the cause? 
Still others, less concerned with the philosophy or logic of 
Thorndike's position, argued that his formulation was lacking 
in generality and not always supported by fact. Observations 
were brought forward to show (i) that trial-and-error was 
typical only of a very restricted form of problem solvingone 
in which "insight" into the situation was prohibited by the 
very conditions of the experiment; (2) that, even with Thorn- 
dike's cats, the solution of a problem was not always "hit-or- 
miss" and gradually achieved, but was, at least in some cases, 
practically reached on the very first trial; and (3) that learn- 
ing is possible without any effect as when rats, given a chance 
to run a maze prior to the introduction of any food reward, 
showed by their speed of learning in later, rewarded runs that 
they had profited from their Unrewarded' explorations. 

In view of later developments, such arguments, and a great 
deal of the research that they fostered, were somewhat beside 
the point. They served to obscure, rather than clarify, a 
fundamental principle; to delay, rather than hasten, an im- 
portant line of investigation. A full appreciation of Thorn- 
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dike's contribution did not come until thirty-odd years after 
his first monograph was published, when the principle was 
re-affirmed and placed clearly within a larger body of theory. 

Skinner and Operant Conditioning 

In 1930, there was published, in the Proceedings of the 
National Academy of Science, a short paper on the eating 
behavior of white rats. The author was B. F. Skinner, then a 
graduate student in psychology at Harvard University. 
Skinner described, in his paper, an- experimental method that, 
in slightly modified form, has become a fixture in modern 
experimental research. It involved simply (i) a device for 
giving a small pellet of food to a hungry white rat each time 
that the animal pushed open the swinging door of a food-bin 
at one end of his experimental chamber; and (2) a recording 
mechanism that caused a vertical movement of a pen upon 
the paper-covered surface of a slowly revolving cylinder (a 
kymograph drum) whenever the rat opened the door of the 
bin to obtain the pellet. The vertical pen marks were made at 
right angles to the drum movement and were cumulative 
that is, each upward distance that the pen moved was added, 
by a ratchet device, to the preceding one. Since a short period 
of time was required for the eating of each pellet before the 
next door-opening took place, and since the drum continued 
to revolve steadily during this period, each mark was dis- 
placed slightly to the right of the preceding one. This pro- 
vided a step-wise record, of the sort shown in Figure 6. The 
vertical lines (of equal length) indicate successive door-open- 
ing responses and the horizontal lines (not necessarily equal 
in length) indicate the time elapsing between responses. Since 
the pellets were of a constant size and weight (about % 5 of a 
gram), an eating rate is represented. 

The step-wise effect is very obvious in Figure 6 because of 
the size of the time-units and response-units we have selected. 
If the units had been smaller, the effect would have been less 
pronounced. Figure 7 is copied from an actual record ob- 
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FIG. 6. Illustrating the construction of a cumulative record of bar- 
pressing responses. 
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Fie. 7. A cumulative record of a rat's food-getting responses (bar-press- 
ing for pellets) during a daily eating period. (After Skinner, 1938.) 
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tained in Skinner's experiment. In this case, the steps are so 
close together that they are imperceptible in our copy and a 
fairly smooth curve of eating results. 

Such a curve is characteristically obtained when a rat is fed 
daily with pellets at a regular time, and is deprived of food 
during the intervening periods. It shows that, under such a 
regimen, the animal begins by responding at a relatively high 
rate and gradually slows down as he nears the point of satia- 
tion. The curve is, to use the mathematician's term, nega- 
tively accelerated. 

The curve does not, of course, represent all eating behavior 
in rats. Had the animals been permitted to live in the experi- 
mental box and eat whenever their hunger led them to do so, 
they would probably have eaten at a somewhat slower but very 
constant rate on each occasion: straight-line, rather than 
negatively accelerated, curves would have resulted. The im- 
portant fact is the orderliness of the behavior that appears 
under a specified set of experimental conditions. The search 
for orderly relationships is characteristic of all science and, 
with the discovery of a single example, one is often led to 
hunt for more. 

The only behavior required of the rats in Skinner's experi- 
ment was the simple and fairly natural act of pushing open a 
door to reach food. How or when the animal learned to per- 
form this act was not determined, but a second experiment, 
reported by Skinner in 1932, dealt with this question. Using a 
modification of his earlier apparatus, he tried to find out the 
way in which a new act, one not previously related to food- 
getting, might come to be so related. The act chosen was that 
of pressing downward, with a force of about ten grams, a 
small lever. This lever, or bar, was situated at one end of a 
response chamber in a sound-resistant and light-proof experi- 
mental box (see Figure 8) . Its downward movement caused 
the ejection of a pellet of food, from a magazine in an adjoin- 
ing chamber, into a small metal cup or tray. With every 
activation of the food-magazine by the bar depression, a rec- 
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ord was made on a kymograph drum outside the experimental 
box. The record was cumulative, as in the study of eating rate. 
The experimental procedure involved (i) a preliminary 
acclimatization of the hungry animal to the response cham- 
ber, with a supply of food in the tray, until he moved and ate 
freely in the situation; (2) further sessions in which the rat 
was accustomed to eating pellets, when they were discharged, 
one at a time, from the food-magazine by the experimenter; 
and (3) training in the bar-pressing response. Stage 3, the 
important one in this experiment, was conducted as follows. 
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FIG. 8. An early model of Skinner's bar-pressing apparatus. (From 

Skinner, 1938.) 



After twenty-four hours' deprivation of food, the rat was 
placed in the response compartment of the box. The bar was 
present, the food-magazine was filled with pellets, and a water 
supply was obtainable from a small tube near the tray, but the 
tray itself was empty. When the rat approached the tray, as 
he had learned to do in stages i and 2, and found no food, he 
soon engaged in exploratory behavior of one sort or another 
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within the chamber. In ten or fifteen minutes, often sooner, 
this exploration led to a depression of the bar, usually accom- 
plished by one or both of the rat's forepaws as he raised him- 
self to sniff at the wall above the tray or at the slots through 
which the bar entered the chamber. The bar-depression was 
accompanied by the click of the food-magazine (to which the 
rat was accustomed in stage 2) and the ejection of a pellet into 
the tray. At the same time, the response was recorded on the 
kymograph drum outside the box. 
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FIG. 9. Some typical cumulative response curves obtained from hun- 
gry rats on the day of conditioning a bar-pressing response for the first 
time. Each response was reinforced with a pellet of food. Notice that 
conditioning is commonly "instantaneous," and that the response rate 
is usually steady. (After Skinner, 1938.) 

A second bar-pressing response usually followed soon after, 
in some cases immediately after, the first pellet had been 
seized and eaten, and the animal quickly developed a maximal 
rate of pressing and eating. The sample records in Figure 9 
illustrate this clearly. The two lower curves in this figure 
show the immediate development of a maximal response rate; 
the upper curves show a slight positive acceleration at their 
beginning, with a few responses occurring before a constant 
eating rate appears. 

It is clear from these curves that the rats quickly learned to 
press the bar when food resulted from the act. Indeed, if we 
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were unaware of the preliminary stages of the experiment 
the acclimatizing of the animal to the apparatus and the train- 
ing to approach the tray when the magazine was operated 
we might conclude from some of the records that the rat had 
already solved the bar-pressing problem when it was first pre- 
sented. Except in those animals that produced positively 
accelerated curves, the learning process was practically instan- 
taneous. And in no case was there anything comparable to the 
gradual and irregular progress that typified the behavior of 
Thorndike's cats. 

Skinner called his study an experiment "On the Rate of 
Formation of a Conditioned Reflex." It was obvious to him, 
however, that the rat's behavior could not adequately be de- 
scribed in terms of the conventional Pavlovian paradigm. A 
number of insurmountable barriers stood directly in the path 
of such an analysis. A practical exercise will convince you of 
the difficulties in applying Pavlov's principle to the bar-press- 
ing situation. Construct for yourself the paradigm, after the 
model in Chapter 2, labeling each S and R appropriately to 
show how the reflex is established. What are the two reflexes 
with which you begin? Where is the third reflex, the condi- 
tioned one? How does the 'conditioned' stimulus come to act 
as a substitute for the 'unconditioned* in eliciting the re- 
sponse to the latter? Be sure that you limit yourself to observ- 
able, rather than purely hypothetical, stimuli and responses. 

As an outcome of his own struggle with this problem, 
Skinner proposed, in 1935, and again in 1937, that we rec- 
ognize two types of conditioning: Type S and Type R. Type S 
is no more than the classical Pavlovian conditioning, in which 
reinforcement is always related to the presentation of a 
stimulus tor example, food is given when a tone is sounded. 
Type R, which is represented in the acquisition of bar-press- 
ing behavior, involves a relation between reinforcement and 
a specific response thus, food is given when the bar is pressed. 

A paradigm suitable for Type R conditioning is shown 
below, along with the familiar Type S schema. A comparison 
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of the two will help you to understand some of the ways in 
which the two types differ. 

TYPE R: s R (bar-pressing) *-S (food) >-R (eatTng) 

(The arrow in the paradigm "ls followed by.") 
TYPE S: S(food) ^ R (salfvatlon) 



r (ear-twitchtag) 



Type S conditioning, as we have seen, involves the elicita- 
tion of a response (salivation) by an identifiable conditioned 
stimulus (tone) that is under the experimenter's control. In 
Type R conditioning, the specific stimulus that initially 
evokes the response (bar-pressing) cannot be identified. This 
is indicated by the small 5 of the Type R paradigm. We need 
not assume that bar-pressing has no cause or that it can bear 
no relation to environmental stimuli; we shall see, in Chap- 
ter 5, that stimuli may "set the occasion" for this response. 
But, for all practical purposes, the response just occurs, is 
initially emitted, without relation to any specifiable stimulus 
agency. 

Type S conditioning involves stimulus substitution and the 
formation of a new reflex. The tone, in our example, comes 
to act as a substitute for food in eliciting salivation; and tone- 
salivation is the new reflex. In Type R conditioning, however, 
there is merely the strengthening of a reflex that already exists 
in the organism's repertory. Bar-pressing, for instance, occurs 
with some frequency prior to any reinforcement with food. 
At any rate, no substitution is involved and no new stimulus- 
response relation is formed. 

Type S conditioning prepares the organism for reinforce- 
ment. The tone comes to elicit salivation in advance of the 
food, paving the way, as it were, for its digestion. Type R 
conditioning procures or produces the reinforcement bar- 
pressing provides the rat with a food pellet. The two processes 
may take place concurrently, although our observation is 
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usually limited to one of them. Pavlov himself noted that 
Type S conditioning was commonly accompanied by "motor 
reactions" of head- turning and the like which we would now 
ascribe to the development of a Type R conditioning. More 
recently, another investigator (Brogden, igsgb) has shown that 
when dogs are reinforced with food for making a leg move- 
ment (Type R conditioning) in the presence of a tone, there 
is the simultaneous development of conditioned (Type S) 
salivation. 

With respect to this last distinction, a simple example may 
not be superfluous. A hungry boy, home from school, is met 
at the door with the odor of freshly baked cookies. In accord- 
ance with his history of Type S conditioning, his 'mouth 
waters/ preparing him for what may follow. But nothing will 
follow unless he has been conditioned in Type R fashion to 
make his way to the kitchen and exhibit the verbal or other 
behavior which, in the past, has been productive of cookies. 

The Law of Operant Conditioning 

These differences between Type S and Type R condition- 
ing are associated with a broader distinction, mentioned 
briefly in the preceding chapter, between two fundamental 
classes of behavior. One of these, which includes all those 
responses, conditioned or unconditioned, that are elicited by 
known stimuli, we called respondent. The other class, com- 
prising all those responses that are emitted more or less 
independently of identifiable stimuli, we may now call 
operant. The spontaneous movements of an infant organism, 
human or otherwise, are mainly of the latter type; and so are 
the "voluntary" acts of human beings. In fact, most of our 
behavior in the routine affairs of everyday life is clearly 
operant, in that it operates or acts upon the environment to 
produce the satisfaction of our basic needs. Respondent be- 
havior is much less commonly observed and seldom, it ever, 
operates upon the environment to produce anything. 

Operant behavior is conditioned primarily, if not exclu- 
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sively, in Type R fashion; respondent behavior is usually 
conditioned in Type S fashion. Hence, when its suits our 
convenience, we may speak of Type R as operant condition- 
ing, in the same way that we speak of Type S as respondent. 
Again, just as we refer to any single example of respondent 
behavior as a respondent, we shall refer to each example of 
operant behavior as an operant. 

We have seen, in Chapter 2, that the strength of a respond- 
ent is commonly measured in terms of latency and response 
magnitude. Neither of these measures is satisfactory in deter- 
mining the strength of an operant. Latency can have no 
meaning in the absence of an identifiable stimulus from 
which to measure the S-R interval; and the magnitude of an 
operant response does not change during conditioning in the 
orderly manner that typifies the respondent. The amount of 
saliva secreted by a dog in response to a tone may increase 
gradually with successive tone-food combinations, but the 
force of a bar-pressing response may be as great on its first 
appearance as it is on its fifty-first, and it may fluctuate 
throughout a long series of emissions. 

Our best measure of operant strength is frequency of occur- 
rence. An operant is strong when emitted often within a given 
period of time; it is weak when emitted rarely. We have, in a 
sense, assumed this already, in the case of bar-pressing: a 
steady, high rate of responding implied a strong response- 
tendency, whereas a slow, uneven rate implied a weak one. In 
the case of a respondent, frequency is a useless measurein 
fact, no measure at all since the response rate is determined 
solely by the rate at which the eliciting stimulus is presented 
to the organism. 

"If the occurrence of an operant is followed by the presen- 
tation of a reinforcing stimulus, the strength is increased/' 
(Skinner, 1938.) We can now begin to grasp the significance 
of this statement of the principle of Type R conditioning. 
Bar-pressing is an operant. It occurs with a certain low fre- 
quency prior to any experimental procedures that we may 
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apply. Its strength is increased when it is followed by rein- 
forcement. Increased strength means merely that it occurs 
with higher frequency than it did before. 

Table II 
COMPARISON BETWEEN TYPE S AND TYPE R CONDITIONING 



Type S 
Paradigm: 

S, r^ 


Type R 

Paradigm: 

0% ... .ft ^.5 (reinforcement) 


Response is elicited. 


Response is emitted. 


Stimulus substitution. 


No substitution of stimuli. 


Formation of new reflex. 


Strengthening of reflex already in 
repertory. 


"Preparation" by conditioned stim- 
ulus for the unconditioned rein- 
forcement that follows. The re- 
sponse does not manipulate the 
environment. 


Response "procures" the reinforce- 
ment. The response "operates" on 
the environment. 


Commonly, if not always, is medi- 
ated by the autonomic nervous sys- 
tem, involving smooth muscles and 
glands. 


Mediated by somatic nervous sys- 
tem, involving skeletal muscles. 


Usually measured in terms of re- 
flex latency or magnitude. 


Usually measured in terms of re- 
flex rate; sometimes, latency. 



Operant Conditioning and the Law of Effect 

When you compare the work of Skinner with that of 
Thorndike, you may be impressed by the numerous dissimi- 
larities. The two men used different species of animals, differ- 
ent apparatus, and different experimental procedures. Yet, 
when you consider that both situations required a manipula- 
tion of some environmental object; when you note that the 
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presentation of food was in each case contingent upon this 
manipulation; and when you compare the principle of 
Type R conditioning with the law of effect, you may notice 
a striking agreement. Both formulations emphasize the 
influence of the outcome of a response upon its strength: 
Thorndike calls it "satisfaction" and Skinner speaks of a 
"reinforcing stimulus." Skinner's formula seems to be the 
narrower of the two, since it contains no equivalent of Thorn- 
dike's "discomfort," yet Thorndike himself later came to 
discount the weakening effect of discomfort a point to which 
we shall return later. 

There is still another similarity. In spite of Thorndike's 
emphasis upon a connection or bond between situation and 
response, it is perfectly clear that he does not refer to the 
Pavlovian type of connection. He would have been the last 
to suggest that the loop-pulling or other manipulative be- 
havior of his cats was elicited in the same way that food elicits 
salivation, a shock elicits foot-withdrawal, or a cinder in the 
eye elicits tears. 

This last point requires some elaboration. Throughout 
much of the present chapter, we have underlined the fact that 
a large proportion of an organism's behavior is emitted rather 
than elicited, and is conditioned in Type R rather than 
Type S fashion. We may have led you to think that responses 
like loop-pulling and bar-pressing can have no relation what- 
ever to stimuli. If such was your impression, it should be 
corrected. Operant behavior, however spontaneous in its 
initial occurrence, very soon becomes associated with stimuli. 
The cat that has learned to pull the loop, or the rat that has 
learned to press the bar, reacts to stimuli or stimulus com- 
binations, even if we cannot specify them completely. In the 
absence of a loop or bar, the animal seldom paws the air. But 
these stimuli are not eliciting: they are not related to their 
responses in the same way that salivation is related to food or 
foot-withdrawal is related to electric shock. It is just that the 
responses are more likely to occur in the presence of such 
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objects. To use an expression quoted earlier, these stimuli 
set the occasion for responses. Later on, when we consider this 
matter in more detail, we shall refer to them as discriminative 
stimuli. 

The Runway Technique 

In the discovery and demonstration of basic principles, 
everything depends upon the kind of method that we employ. 
We seek, first, to find lawful relationships between known 
variables. This is the aim of all scientific endeavor. But, at 
the same time, we hope that our findings can be related to 
each other, within an integrated whole, and that our general- 
izations will apply to less restricted experimental situations 
than the one from which they were initially drawn. In these 
respects the bar-pressing technique has proved especially use- 
ful, as you will see again and again in the pages to come. It 
is, however, by no means the only method to which experi- 
mental psychologists have appealed in their attempts to 
analyze the behavior of organisms. 

A relatively simple means of studying operant behavior, 
and one in which Type R conditioning is readily apparent, 
is the runway method. In this procedure, as recently employed 
by Graham and Gagn (1940), hungry rats are used as sub- 
jects. After several periods of acclimatization to experimental 
conditions, the animal is placed in a starting-box, the sliding 
door of which opens to an elevated wooden pathway, three 
feet long and three-quarters of an inch wide. At the other end 
of the pathway is a food-box, identical in its dimensions with 
the starting-box. The rat's task is merely that of running from 
the starting-box to the food-box when the door of the former 
is opened by the experimenter. Upon entering the food-box, 
the door of which is then closed behind him, he is reinforced 
with a bit of food. When the food is eaten, the boxes are 
carefully interchanged and, after a pause for equalizing the 
between-run intervals and baiting the new food-box, the 
procedure is repeated. 
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FIG. 10. A version of the Graham-Gagne* apparatus. (After Raben, 1949.) 

Progress in this task is measured in terms of the time elaps- 
ing between the opening of the starting-box door and the rat's 
passing of a point four inches along the runway. This time 
interval was found by Graham and Gagn to decrease on suc- 
cessive runs. For a group of 2 1 animals, the average (geomet- 
rical mean) value obtained on the first run was 71 seconds. 
The second trial required only 17 seconds and, by the fif- 
teenth run, the low value of 2.8 seconds was reached. These 
data are represented in Figure 1 1. 

It is obvious, in this study, that operant conditioning 
occurred, through the reinforcement of the running response 
with food. Yet the situation is different in certain important 
respects from the one used in bar-pressing studies. For ex- 
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ample, in the Graham-Gagn method, the experimenter, as 
well as the subject, determines the frequency of the running 
response. Only at certain intervals is the response made pos- 
sible. This is essentially 
the trial-by-trial proce- 
dure employed by Thorn- 
dike in his problem-box 
experiments; and Thorn- 
dike, too, used a time- 
measure of progress. It is 
interesting to note that, 
although the curve of 
Figure 11 is an averaged 
record of 21 rats, it re- 
sembles closely that of 
Figure 5 (page 38), ob- 
tained by Thorndike with 
a single cat. 

Another point of differ- 
ence between the runway 
and bar-pressing tech- 
niques lies in the fact that 
the runway requires con* 
ditioning of a series or 
chain of responses (see 
Chapter 7) that take more 
time and are presumably 
Fie. 11. A plot of average starting greater in number than 
times on successive trials by a group in the case of bar-pressing, 
of rats on the Graham-Gagne' appara- ^ - , 

tw. (From data provided by Graham Consequently, the rein- 
and Gagne", 1940.) forcement of the bar- 

pressing response is more 

immediate and direct than the reinforcement for leaving the 
starting-box when the door is opened. It is probably for this 
reason that the whole runway performance is learned more 
gradually than is bar-pressing. 
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The Maze Technique 

You have probably recognized that the bar-pressing tech- 
nique is actually a simplified form of the problem-box 
method, in which the trial-wise procedure is replaced by the 
more useful "free operant" arrangement that is, the animal 
determines his own response rate. Similarly, the runway 
method may be considered as the final stage of another impor- 
tant line of development in the investigation of operant 
behavior. In 1901, when Thorndike was busy with his 
problem-solving studies at Columbia, W. S. Small, at Clark 
University, was exploring the trial-and-error behavior of rats 
with a device that was soon to enjoy tremendous vogue among 
American psychologists. Even today, when the popularity of 
this device has greatly decreased, the beginning student of 
psychology generally expects any reference to rats to be fol- 
lowed by talk about mazes. 

Interest in mazes or labyrinths is understandable. The early 
Greek equivalent of "labyrinth 1 * was applied to intricate 
underground passages and networks of chambers, which never 
fail to stir the imagination. From the great Egyptian laby- 
rinth, described by Herodotus as containing 3,000 chambers, 
to the decorative garden mazes of eighteenth-century France 
and England, and the amusement-park mazes of modern 
times, men have been intrigued by such devices. It is not 
strange that Small, in his search for a task the mastery of 
which would be slow and measurable for animals that spend 
most of their lives in dark and winding passages, should have 
hit upon this form of apparatus. 

The maze that Small built was a crude, wire-mesh affair, 
with a sawdust-covered floor. In design (see Figure 12) , it was 
modeled after the famous hedge maze on the grounds of 
Hampton Court Palace in England. Like most mazes used 
since, it comprised a series of straight-aways, turns, choice- 
points, and blind alleys, with a reward at the end of the route. 
Progress can be shown by the reduction of running-time on 
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successive trials, or in the number of blind-alley entrances. 
Small's own studies were not extensive, but suggested sev- 
eral lines of research. He noted that rats decreased their 
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FIG. 12. Design of the Hampton Court maze used by Small. The animal 
starts at E t and finds food at F. (From Small, 1901.) 

running-time and errors with continued practice; that they 
adopted a short-cut provided by the pattern (at the fourth 
choice point); that lessened hunger brought greater variabil- 
ity of behavior; and that they seemed to depend less upon 
sight and smell than upon touch or movement (propriocep- 
tive) cues. All these matters were subjected to later study by 
other investigators, who improved and standardized the train- 
ing procedure and developed new and more reliable appara- 
tus (see Figure 13) . Maze units were equalized in length and 
increased in number; one-way doors were introduced to* 
prevent retracing of paths; extra-maze distractions were elim- 
inated; motivation was more rigorously controlled; and so 
on. Such studies were undertaken to determine the influence 
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upon maze-learning scores of such factors as age, sex, previous 
maze experience, sense-organ participation, distribution of 
practice, and brain destruction. Maze performance was, thus, 
the dependent variable or index reflecting the effect of what- 
ever independent variable the experimenter chose. 




Fie. 13. A typical pattern of the Warner- Warden maze. (From C. J. 
Warden, T. N. Jenkins, and L. H. Warner, Comparative psychology, 
Vol. I. Copyright 1935 by The Ronald Press Company.) 

Along with this development came the hope that an analysis 
of "learning" could be made with this useful instrument. 
. , __, Unfortunately, this 

I j I hope was not realized. 

j 1 j it gradually became 

^""^ clear that maze-learn- 
ing was an exceedingly 
complicated affair, and 
that the maze itself was 
not the simple device 
that it had seemed to 
be. Even when atten- 
tion was centered upon 
the behavior of a rat at 

Fie. 14. A T-maze with a single choice a single choice point, 
point. The animal starts at E, and rein- 

were too great for the 
kind of description 
most desired by the scientist. 
We shall return to the problem of maze-learning in a later 



forcement is located at one of the end- 
boxes (G). 
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chapter, after you have become acquainted with the concepts 
required for its explanation. For the present, we may limit 
ourselves to the statement that, for a close analysis of be- 
havior, the maze is useful only when reduced to its very 
simplest formthat is, when it becomes no more than a run- 
way or a single T. 

Quick Learning 

It was noted, earlier in this chapter, that the bar-pressing 
response of a white rat may be conditioned with a single rein- 
forcement; and two cases of this were pictured in the cumula- 
tive response curves of Figure 9. This is not an unusual 
finding. Suppose you were to use a modification of the Skinner 
apparatus (see Figure 8) in which the animal's living-cage 
is transformed into a response chamber by the mere insertion 
of a bar. Under such circumstances, with hungry animals, and 
with only the briefest of acclimatizing and pre-training pe- 
riods, "one-trial learning" is commonly observed. Seldom 
does conditioning fail to occur within a few minutes after the 
bar is first presented. The rat, one might say, "gets the point" 
immediately, often reaching a steady response-rate after the 
discharge of but one pellet into his tray. 

This kind of behavior has sometimes been likened to the 
sudden, "insightful" achievements of animals higher than 
the rat in the phylogenetic scale. For example, Wolfgang 
Kohler (1925) presented chimpanzees with a variety of fruit- 
getting problems stacking boxes, using sticks, taking detours, 
and the like. He noticed, and described vividly, many in- 
stances in which the apes, after a preliminary survey of the 
situation, attained their objectives quickly and conclusively, 
in almost-human fashion, without any discernible "trial and 
error." Such results are in sharp contrast with the slow, step- 
wise progress of rats in mazes and cats in problem-boxes, and 
Kohler argued that his own animals, by virtue of the experi- 
menter's choice of problem situation, were permitted to show 
their true intelligence or use of "insight." 
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Today we can say that the sudden or one-trial mastery of 
any problem is due to one, or both, of two factors: (i) the 
similarity of the problem to one that was solved on an earlier 
occasion, or (2) the simplicity of the problem itself. Kohler's 
findings are probably attributable, in large part, to the first 
of these factors. He took samplings of behavior, like those 
taken in human intelligence tests; and the success of his 
animals may be ascribed to their past history of conditioning 
(see our treatment of similarity in Chapter 5). Rapid acquisi- 
tion of the bar-pressing response, on the other hand, is trace- 
able primarily to the relatively uncomplicated nature of the 
task. Bar-pressing has considerable unconditioned strength to 
begin with: it occurs with more than zero frequency prior to 
any reinforcement that we provide. Moreover, the specific 
movements involved are very few in number, and this is a 
good reason for calling a problem "simple." We could easily 
complicate matters in the bar-pressing situation, say by de- 
manding a greater force of response, or by placing the food 
tray at some distance from the bar, so that the animal would 
have to leave the vicinity of the tray to obtain the next rein- 
forcement. By requiring this additional activity, we could 
undoubtedly lengthen the learning time, unless the rat had 
had experience in getting food under similar circumstances. 

In the case of human beings, the solution of problems may 
be speeded up by a special set of conditions. It is too early for 
us to consider these conditions here, but we may note that 
the possession of language is often of help in reducing the 
time required or the number of errors made in the mastery of 
certain tasks. Thus, the person who 'verbalizes* his choice- 
responses while finding his way through a complicated maze 
will make a performance record definitely superior to that of 
a person who does not employ such aids (Warden, 1924). 
Apparently, verbal sequences can be memorized by the human 
learner faster than the purely manual or locomotor pattern. 
These verbal sequences arise from his movements and then 
come to direct the chain in a discriminative fashion. In some 
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instances, the rate of improvement is so dramatic as to obscure 
the fact that essentially the same basic principles are involved 
in verbal as in non-verbal behavior. 

Positive and Negative Reinforcement 

Thorndike, in his 1911 statement of the law of effect, spoke 
of the strengthening effect of "satisfaction'* upon the bond 
between situation and response. Today, avoiding controversy 
about the nature of "satisfaction," we would say that the food 
he gave to his cats for opening a problem-box door was posi- 
tively reinforcing. On the observational level, this would 
mean exactly what Thorndike meant that the effect of the 
food was to increase the frequency of the response that pro- 
duced it. We know, too, that water, for a thirsty animal, 
would have had a similar effect. Food and water belong to a 
class of positive reinforcers. 

This is not all that Thorndike said. He spoke also of the 
weakening effect of "discomfort 1 * upon situation-response 
connections. Certain stimuli (electric shocks, loud sounds, 
strong lights, etc.) serve to decrease the frequency of responses 
in the wake of which they follow. Nowadays, we call them 
negative reinforcers, but they are not best defined in terms of 
their weakening function. By 1932, Thorndike himself 
argued that "rewards" and "punishments" are not opposed 
to each other in the manner implied by his earlier formula- 
tion; and we shall offer evidence, in the next chapter, to show 
that the weakening effect of negatively reinforcing stimuli is 
not permanent. 

Another, and probably a better, way of handling the matter 
is to define positive reinforcers as those stimuli which 
strengthen responses when presented (e.g., food strengthens 
bar-pressing or loop-pulling behavior), and negative rein- 
forcers as those which strengthen when they are removed. 
Experimentally, a number of responses have been condi- 
tioned in animals entirely on the basis of escape from, or 
reduction of, certain stimulus conditions. Mowrer (1940) 
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showed, for example, that a panel-pushing response could be 
rapidly set up in white rats when it was reinforced by the 
removal of electric-shock stimulation; and Keller (1942) 
obtained similar results when he conditioned bar-pressing in 
rats by the simple device of turning off a bright light for sixty 
seconds whenever the response was emitted in its presence. At 
the everyday level, too, we often see the results of this kind of 
strengthening. We move out of the heat of the noonday sun; 
we close the window that shuts out the roar of traffic; we take 
off the shoes that pinch our feet; and we kindle the fire that 
will warm our hands. In each case, we perform an act that has 
previously been strengthened because it produced the cessa- 
tion of a "noxious" or "annoying" stimulus. 

We have, then, two ways of defining negative reinforcers: 
the first is in terms of the weakening effect they have when 
presented; the second is in terms of the strengthening effect 
of their removal. The effect is upon operant behavior; an 
operant is weakened in one case and strengthened in the 
other. Yet, it should be noted that the same operant cannot 
simultaneously undergo both changes. A strong light, applied 
briefly whenever a bar-pressing response occurs, will depress 
the frequency of the pressing (Schoenfeld, 1947); if the same 
light is continuously applied until a bar-pressing response 
occurs, and is then immediately extinguished, the response 
will be strengthened. But the response cannot produce and 
remove a stimulus at one and the same time. 

O per ant-Respondent Overlap 

Coincidental with their effect upon operant behavior, nega- 
tively reinforcing stimuli may exercise another function: they 
may serve as eliciting stimuli for respondent behavior. Two 
cases arise: (i) elicitation may accompany the weakening 
function of a negative reinforcer, as when a strong electric 
shock inhibits bar-pressing and, at the same time, elicits foot- 
withdrawal, squealing, blood-pressure changes, and so forth; 
and (2) it may accompany the strengthening function, as 
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when a shock elicits a foot-withdrawal, which is then strength- 
ened operant-wise by the shock-removal. The second case 
is represented in the accompanying paradigm. 

S (shock) -R (flexion) >- S (shock-removal) 

Another combination of elicitation and operant strengthen- 
ing is seen in an experiment reported by Konorski and Miller 
(1937). Using an electric shock just intense enough to elicit 
a leg-raising response in a dog, they gave food to the animal 
after each elicitation. The scheme, then, would be this: 

S (shock) *> R (flexion) >> 3 (food) 

Here the shock elicits flexion and the flexion 'produces' food. 
We are therefore prepared for the finding that "after a few 
reinforcements the animal starts to raise its leg independently 
of electrical shock as soon as it finds itself in the given 
experimental situation/' (Konorski and Miller, 1937, p. 266) 
Such cases of overlap may be hard to understand unless you 
perceive that, with the use of shock, one elicits responses of 
those very muscles which are used in operant behavior. 

A question often occurs in connection with operant-re- 
spondent overlap: since a skeletal-muscle or 'motor' respond- 
ent, such as foot-withdrawal to shock, may be conditioned in 
a Type R manner, cannot an autonomic respondent, like 
salivation or the galvanic skin response, be strengthened by 
the same procedure? Very little information exists on this 
point, but the answer is probably No. One may apparently 
strengthen an operant which will in turn produce the condi- 
tioned stimulus for a respondent, as in the Hudgins and 
Menzies experiments (see pages 25-27), but we do not there- 
by liberate the respondent from its dependence upon an 
eliciting stimulus. As for the reverse effect, it is not clear, at 
this time, that a motor respondent can be conditioned by a 
Type S procedure. The possibility of sensitization (see page 
29) or some kind of operant reinforcement, such as shock- 
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removal, has seldom been eliminated in experiments designed 
to test this matter. 

Related to the latter point are some unpublished observa- 
tions by Reinwald (1941) . A dog was first trained, by a Type 
R procedure, to lie quietly on his right side, with his left leg 
suspended by a harness in such a way as to permit its un- 
impeded movement whenever the patellar tendon of the leg 
was tapped lightly with a small hammer. Then, in an attempt 
to condition this patellar reflex or knee jerk, taps to the 
tendon were combined with visual stimulation a moderately 
intense light just in front of the animal's eyes. The procedure 
thus conforms to the Pavlovian paradigm: 



(tendon tap) >. R (knee Jerk) 




S (light) === >. r (unidentified) 

One thousand combinations of the conditioned and uncondi- 
tioned stimulus were presented to the dcg over a period of 
many weeks, with the following result: no evidence of Type S 
conditioning was obtained. Although the tendon tap regularly 
elicited the knee jerk, the light never came to exercise the 
least effect upon the response. Results of this sort should not 
be hastily generalized, since the patellar-reflex findings may 
not be typical, but they do suggest a critical scrutiny of the 
alleged cases of respondent conditioning when somatic or 
"motor" lesponses are involved. 

The Importance of Operant Conditioning 

The principle of operant conditioning may be seen every- 
where in the multifarious activities of human beings from 
birth until death. Alone, or in combination with the Pav- 
lovian principle, it is involved in all the strengthenings of 
behavior with which we shall be concerned in this book. It is 
present in our most delicate discriminations and our subtlest 
skills; in our earliest crude habits and the highest refinements 
of creative thought. It accounts, in large part, for our ab- 
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normal 'fixations' as well as our normal 'adjustments'; for our 
parades of power and our shows of weakness; for cooperation 
no less than competition. It may be seen in our friendly rela- 
tions with, and our withdrawals from, our fellows; in our 
expressions of bigotry and toleration; in our virtues as well 
as our vices. 

We do not expect you to accept this appraisal without ques- 
tion or reservation at this time. We have scarcely begun our 
analysis of behavior. Only a few experiments have been cited, 
most of them drawn from the laboratories of animal research. 
Other principles and other findings have yet to be considered, 
and these in turn must be related to the ones already treated. 
Nevertheless, so basic and so far-reaching is the law of operant 
conditioning that, even now, you should be able to find, in 
your own experience, many illustrations of its action. Later 
on, as we deal with ever more complex problems, you will be 
more, rather than less, aware of its explanatory power. 

NOTES 

We have ignored, in our discussion of Thorndike's work, his many 
studies of connection-formation in human beings, where the mere 
announcement of Right or Wrong by the experimenter was used as a 
reward or punishment for some response, usually verbal, on the part 
of the subject. This omission will be understood after you have seen, 
in Chapter 8, how words and other stimuli, not initially reinforcing, 
may come to exercise such an effect upon operant behavior. Also, in 
our treatment of the runway and the maze, no mention has been made 
of the way in which results from these devices have exemplified, even 
clarified, the operation of other principles than the one to which this 
chapter is devoted. This, too, will be corrected as we go along. 

The way in which we have treated the concept of "insight" deserves 
a word of qualification. Kohler's fascinating reports of chimpanzee 
behavior (The mentality of apes, 1925) were once widely regarded as 
convincing demonstrations of the inadequacy of conditioned reflex 
theories. His Umweg or detour method of experimentation, in which 
animals were required to use crude tools (sticks, strings, etc.) or round- 
about approaches to their food-objectives, was hailed by some as the 
best of all instruments for analyzing 'intelligent' behavior. Today, 
however, we see that Kohler's method left much to be desired, since 
it involved no study of the relation between known variables, as pre- 
supposed in scientific inquiry. His observations did little more thai* 
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suggest problems for further research. Conditioning principles may be 
expected to apply to them as well as to any other instance of everyday 
problem-solving. 

The distinction between operant and respondent conditioning, 
although anticipated in the writings of many psychologists, was not 
sharply drawn until 1935, in an important paper by Skinner. Since 
then, it has been endorsed and discussed by a number of authorities 
in this area of research (Schlosberg, 1937; Hilgard, 1937; Razran, i939a; 
Hilgard and Marquis, 1940; and Mowrer 1947). The terms classical 
and instrumental, employed by Hilgard and Marquis, are rather widely 
used equivalents, respectively, of Type S and Type R conditioning. 

Hilgard and Marquis have described four categories of instrumental 
(operant) conditioning, (i) Reward training, in which responses are 
strengthened through the presentation of positively reinforcing stimuli; 

(2) escape training, in which they are strengthened through the termi- 
nation or reduction of "noxious" (i.e., negatively reinforcing) stimuli; 

(3) avoidance training, which is accomplished when "the learned reac- 
tion prevents the appearance of a noxious stimulus"; and (4) secondary 
reward training, in which strengthening results from presenting stimuli 
which have previously accompanied positive reinforcements. You will 
note that we have already considered the first two of these categories; 
avoidance behavior and secondary reward training will be treated in 
Chapters 8 and 9. 

Some psychologists prefer not to use the term conditioning in con- 
nection with the strengthening of operant responses, bar-pressing or 
otherwise. They speak of trial-and-error, law-of-effect, or, simply, effect 
learning. We need not object to these terms, as long as the reference 
is clear, but we think it more appropriate to adopt the notion of two 
types of conditioning: (i) the Pavlovian case, in which a reinforcing 
stimulus (e.g., food) is provided in connection with the presentation of 
a stimulus (e.g., tone); and (2) the Thorndikian case, in which the rein- 
forcing stimulus is contingent upon a response. The important thing, 
for us, is that there are two reinforcement contingenciesone with an S 
and one with an R. 

In connection with the concept of positive reinforcement, it has been 
argued (e.g., by Hilgard and Marquis, 1940) that our quoted principle 
of operant conditioning on page 50 is circular. We use reinforcing 
stimulus to explain the strengthening of an operant; but we define a 
reinforcing stimulus in terms of its strengthening effect. The fact of the 
matter is that some stimuli strengthen the responses they follow, and 
others do not. To the former alone do we apply the term reinforcing. 
A less debatable wording of our principle might run as follows: There 
are stimuli which have the power to strengthen the operant responses 
that produce them. This strengthening may be termed 'operant condi- 
tioning! and the stimuli may be referred to a class called 'reinforcing/ 
This is the essence of our quoted principle, and the circularity is 
superficial. 



EXTINCTION AND RECONDITIONING 

NEVER suffer an exception to occur Each lapse is like the letting fall 

of a ball of string which one is carefully winding up; a single slip undoea 
more than a great many turns will wind again. 

William James, on the making and breaking 
of habits. Principles of Psychology, 1890 



The Adaptability of Behavior 

So long as life endures, a creature's behavior is a clay to be 
molded by circumstances, whimsical or planned. Acts added 
to it, and other acts which fall out, are the means by which 
it is shaped. Like the two hands of an artisan, busily dabbing 
and gouging, are the two processes, reinforcement and ex- 
tinction. 

Reinforcement is the indispensable condition for strength- 
ening reactions. But, as we know, its effect is exercised in the 
presence of all of the stimuli existent at the time it occurs. 
Some of these stimuli (such as the day's temperature, passing 
odors, and momentary illumination) may be irrelevant in 
that they are not the ones necessarily correlated with rein- 
forcement. They may, on later occasions, be quite different 
while the reinforcement continues to be associated with only 
one stimulus. If irrelevant stimuli were perpetually to arouse 
the response, we would have a picture of sheer biological in- 
efficiency: energy spent uselessly, time lost, and impaired 
chances for survival. The adaptability of behavior to critical 
stimuli depends on the possibility of diminishing the response 
to non-critical ones. Such a decline in reaction strength foi- 
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lows the withholding of reinforcement. This is called extinc- 
tion, and is the process we have now to examine. 

Respondent Extinction 

Just as a Type S reaction is strengthened by the presence of 
the unconditioned stimulus, so it is weakened by its absence. 
Suppose we halt the concomitant presentation of conditioned 
and unconditioned stimuli, but continue to present the con- 
ditioned stimulus. When this is done, and we measure the 
magnitude of the response, we find that on successive tests the 
response to the conditioned stimulus decreases and eventually 
reaches zero. This is what Pavlov called experimental extinc- 
tion, and the principle may be stated as follows: 

// a conditioned reflex of Type S is elicited without the 
presentation of the reinforcing stimulus, its strength 
decreases. 

A response, then, is said to be extinguished when, as a con- 
sequence of its dissociation from reinforcement, the condi- 
tioned stimulus has lost its eliciting power. 

The table below shows the phenomenon as reported from 
Pavlov's laboratory. A dog was conditioned to salivate at the 
sight of meat-powder, through many trials in which he was 
shown the powder and then allowed to eat some. Extinction 
was then carried out, with thirty-second showings of powder 
which were never followed by eating. In a few trials, the 

Table III 

EXTINCTION OF A CONDITIONED SALIVARY REFLEX 

(Data from Pavlov, 1927) 
Successive Number of cc. of saliva 

unrein forced secreted in each thirty- 

stimulations second period 

i i.o 

* .6 

3 -3 

4 -i 

5 .o 

6 .o 
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conditioned stimulus lost its power to elicit salivation. You 
will note that some extinction occurred with each unrein- 
forced stimulation, and that the drop in response went rap- 
idly at first, and then more slowly. This progression is typical 
of the various Type S connections which have been set up and 
broken down in many laboratories since Pavlov's early 
experiments. 
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FIG. 15. Average extinction curve of the conditioned galvanic skin 
response obtained from 20 human subjects. The ordinate is in per cent, 
so that for no preceding extinction trials (i.e., on the last conditioning 
trial) the amplitude of the GSR is given the value of 100 per cent. The 
unconditioned effect of the CS (tone) accounts for the curve's failure 
to drop to a zero level. (From Hull, 1943, based on data supplied by 
C. I. Hovland.) 

The actual speed of extinction depends on several factors. 
(i) Fully-conditioned reactions extinguish more slowly than 
those based on only a few reinforcements, and over-condi- 
tioned reactions more slowly than those just brought to full 
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strength. (2) Extinction seems to take fewer trials when un- 
reinforced elicitations come close together than when distrib- 
uted over a longer time. (3) Higher-order conditioned 
respondents are very susceptible to extinction: a few presen- 
tations of the stimulus without reinforcement from the 
preceding conditioned stimulus, and the reaction is at zero. 
(You will recall that higher-order Type S reflexes are difficult 
to establish, unreliable when obtained, and usually quite 
weak. Their inability to resist extinction is an added reason 
for discounting their importance in the life of an organism.) 

Spontaneous Recovery 

Extinction is not necessarily permanent or complete when 
the response has once reached zero magnitude. When the 
animal is again brought into the laboratory, the response to 
the conditioned stimulus usually reappears in some strength, 
albeit below the former maximum. This phenomenon is 
called spontaneous recovery and, although its origin is not 
well understood, it has regularly occurred in both Type S and 
Type R behavior. For example, the dog whose salivary ex- 
tinction is shown in Table III was tested again by visual pre- 
sentation of meat powder only two hours after this record 
was obtained. His response then measured .15 cc. of saliva. 
Greater recoveries than this have been found with other 
responses and under other circumstances. The spontaneous 
recovery of higher-order Type S reactions is negligible. 

Extinction following spontaneous recovery is faster than 
original extinction. If there is a second recovery, it is less in 
magnitude than the first and is extinguished more rapidly. As 
a rule, only a few extinction sessions are needed to overcome 
the diminishing recoveries and to reduce the response 
strength to a stable zero. 

Operant Extinction 

Conditioned operants are extinguished by severing the rela- 
tion between the act and the effect. As successive responses 
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fail to produce reinforcement, the recurrence of the response 
is less and less likely. The principle of Type R extinction 
may be put as follows: 

The strength of a conditioned operant may be dimin- 
ished by withholding its reinforcement. 

As in the case of operant conditioning, the principal meas- 
ure of operant extinction is the frequency of the response in 
time. The loss in strength is seen in a fallen rate of emission. 
This is portrayed clearly in the cumulative response curve of 
Figure 16. As responses come more and more slowly, the 
cumulative curve bends over and takes on a characteristic 
shape. 
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FIG. 16. Typical cumulative response curve for extinction of bar- 
pressing by a white rat following about 100 reinforcements. (After 
Skinner, 1938.) 

The extinction curve for a response hitherto regularly rein- 
forced (that is, with a reinforcement for each emission) is 
usually, if not always, rather uneven. It begins with a steeper 
slope (higher response rate) than that during regular rein- 
forcement, partly because responses are no longer separated 
by eating time, and partly because the animal is apt to attack 
vigorously the now-unrewarding bar. Thereafter, the curve 
is marked by wavelike changes in rate which distort it in 
detail but still permit the drawing of a smooth 'envelope* to 
describe the over-all trend. These bursts and depressions of 
response might be called emotional in character, the counter- 
part of more complicated frustrations and aggressions seen in 
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man. Consider, for example, the frustration-aggression pat- 
tern in the behavior of a child who struggles with knotted 
shoe-laces or an unyielding door, or the effect of an unrespon- 
sive listener in coldly dousing our dinner-party stories. 

Resistance to Extinction as a Measure of Strength 

A strong habit is one that tends to persist after the rein- 
forcement has been discontinued, whereas a weak one suc- 
cumbs more quickly. Taken by itself, emission rate under 
regular reinforcement is not a good indicator of operant 
strength. With bar-pressing for food, for example, rate is 
greatly affected by such incidental things as the size and hard- 
ness of the pellets which determine chewing time. On the 
other hand, resistance to extinction serves to disclose quite 
well what strength an act has acquired during a training 
period. 

But how to measure this resistance? We can use the total 
number of responses emitted in extinction, or the number 
required to reach some arbitrary extinction criterion such as 
the first five-minute or ten-minute interval in which no re- 
sponding occurs. The number of responses in a complete 
extinction is generally more satisfactory, since any short-time 
criterion excludes possibly important data contained in the 
rest of the extinction curve. On the other hand, extinction 
criteria have the advantage of being time-saving and experi- 
mentally convenient, and are therefore frequently employed. 

Resistance to extinction after regular reinforcement is 
governed by a number of factors, among which are the 
following: 

i. The number of reinforcements applied to the response. 
Williams (1938) trained five groups of rats to press a bar for 
food. The animals in each group were allowed a given num- 
ber of reinforcements before extinction was begun, the low- 
number group receiving but five reinforcements, and the 
high-number group, 90. In Figure 17 is plotted the mean 
number of unreinforced responses made by each group up to 



EXTINCTION AND RECONDITIONING 73 

a criterion of five minutes of no response. Resistance to ex- 
tinction is seen to increase with number of reinforcements 
up to a point beyond which additional reinforcements pro- 
duce very little increment in strength. (In mathematical 
terms, an asymptote of strength is approached.) 
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FIG. 17. Curve showing the relation between number of reinforce- 
ments and number of extinction responses. The number of responses 
made is taken as the measure of strength of conditioning. (After 
Williams, 1938.) 

2. The amount of reinforcement given for each response. 
This factor has recently been investigated by Zeaman (1949). 
Four groups of rats were given one run a day on a Graham- 
Gagn runway for twenty days, with a different amount of 
food reinforcement for each group (.2, .4, .8, and 1.6 grams). 
Zeaman was able to show that the amount of reinforcement 
received during training was related to the highest speed of 
starting (see page 54) that each group ultimately attained. 
The group given a small amount of food did not reach as low 
a starting-time level as the group given a large amount. These 
results accord with those from an early study by Grindley 
(1929) which indicated that chicks will traverse a runway 
faster for six grains of rice per trial than they will for one 
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grain. Similarly, Wolfe and Kaplon (1941) found that chicks 
learned a maze faster when a whole kernel of popcorn was 
given for each run than they did when only a quarter of a 
kernel was provided. At the end of the training period in 
Zeaman's experiment, when the starting-times had decreased 
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FIG. 18. Final strength achieved by the conditioned salivary reflex 
in the dog as related to the amount of food reinforcement used in the 
conditioning. One dog was used with four different CS's, each CS being 
reinforced by a different amount of food. The training schedule in- 
volved presentation of all four stimuli in random order, but each 
stimulus was always associated with its own amount of food. The 
curve shows the limit of response strength reached for each stimulus, 
and indicates that respondent as well as operant conditioning is influ- 
enced by the amount of reinforcement employed. (From Hull, 1943; 
data supplied by W. H. Gantt.) 

to a minimal value for each group, the running response was 
extinguished. Although extinction began with the groups at 
different levels, and although the number of non-reinforced 
trials was insufficient for a complete test, Zeaman's analysis 
suggests that an increase in amount of reinforcement goes 
with an increased resistance to extinction. 

3. The degree of motivation present during extinction 
and conditioning. You will see, in Chapter 9, that resistance 
to extinction depends upon the motivation (hunger, thirst, 
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etc.) present at the time. Suppose a group of animals, equally 
hungry, are conditioned with the same number of reinforce- 
ments. If they are then extinguished under different degrees 
of hunger, the hungrier ones make more responses than do 
the less hungry (pages 265-266). That is, resistance to extinc- 
tion is greater when extinction is carried out under high 
motivation than under low (Perin, 1942). Oddly enough, 
however, when groups of animals are conditioned under dif- 
ferent degrees of (hunger) motivation and extinguished under 
the same degree, the extinction responses do not reflect the 
differences in training conditions. A certain low degree of 
motivation is required for any conditioning, but added in- 
crements do not seem to give added strength (Strassburger, 

195)- 

Table IV 
THE EFFECT OF VARYING DRIVE LEVEL AT TIME OF CONDITIONING UPON 

THE STRENGTH OF CONDITIONING 

(Data from Strassburger, 1950) 



Number of 
Reinforcements 


Hours of Deprivation at Conditioning 
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128.1 

994 
45.0 
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The table summarizes the experimental design and findings. Each 
entry in the table represents one group of animals conditioned in bar- 
pressing under the indicated length of food deprivation and with the 
indicated number of reinforcements; the numerical value of the entry 
gives the mean number of bar-pressing responses made in the first hour 
of extinction. Ten to twelve animals were used in each group. Extinc- 
tion was carried out at the same drive level (23 hours of deprivation) 
for all groups. None of the values in any row is significantly different 
from the others in that row by statistical test. Thus, for a given number 
of reinforcements, the drive level at time of conditioning did not affect 
the strength of the response (as measured by resistance to extinction). 
Note, however, that there is a relation shown between the number of 
reinforcements and resistance to extinction. This latter finding cor- 
roborates that in Figure 17, page 73. 
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When Is a Response Extinguished? 

An operant must exist in some strength before it can be 
conditioned; it must be emitted once in a while at least, in 
order for us to reinforce it. This unconditioned rate of emis- 
sion may be called the operant level for that response, and it 
appears as part of the general activity of the organism. It 
determines the quickness with which a response can be rein- 
forced: if the act comes infrequently, there is a long wait; if 
the first reinforcement does not take hold, conditioning is 
inconveniently delayed. 

From the fact of operant level, it follows that an extin- 
guished response will not reach a zero rate, but will return to 
one that existed before reinforcement. Thus, the cumulative 
response curves for bar-pressing extinction in the rat will 
approach or reach a slope that approximates the one that 
existed prior to conditioning. For this reason, experiments 
which aim to compare numbers of responses in extinction 
after different kinds or amounts of training must take account 
of the operant level. Unconditioned emission rates must be 
determined for all animals before the experimental factor is 
introduced. The groups may be equated for prior level, or a 
correction may be applied to the final data. 

Spontaneous Recovery of a Type R Response 
Suppose that, on a given day, after conditioning, a period 
of extinction has reduced bar-pressing to a low rate. The 
next day's session, with all conditions the same, will give 
another, but smaller, extinction curve. This observation has 
been made in a number of studies. Youtz (1938), for example, 
extinguished bar-pressing in rats to a criterion of no response 
in twenty minutes. One day later, in a second extinction 
session, he obtained as much as 55 per cent of the responses 
emitted in the first. The amount of recovery diminishes, how- 
ever, with successive extinctions until none is discernible 
apart from that which arises from general activity. 
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Further consideration of this spontaneous recovery suggests 
that the number of responses 'recovered' does not add up to 
more than would be expected from an extension of the orig- 
inal extinction curve; in other words, if the extinction curve 
had been continued without interruption, the number of 
responses would have been about the same as that obtained 
from a spaced succession of shorter extinction periods. This 
may, or may not, be true; there is some evidence to the con- 
trary (Ellson, 1939; Keller, 1940). It does appear, however, 
that the longer the interval between original extinction and 
a later one, the greater the accumulation of unexpended re- 
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FIG. 19. A schematic curve illustrating how spontaneous recovery 
after a lapse of 8 hours might approach the extinction curve (dashed 
line) that would have been expected if extinction had not been inter- 
rupted. (After Skinner, 1938.) 

sponses, and the greater the apparent recovery. This effect 
may be quite pronounced, even for intervals measured in 
minutes, if extinction is interrupted very soon in the first 
session. At that time, responses are coming out at a good clip 
and, consequently, many will pile up in a short space of time. 
An example of this is afforded by Ellson's (1938) study in 
which extinction was carried out to a five-minute period of 
no responding. (Such a criterion is reached before the process 
is very far advanced, and many responses are still to come.) 
Subsequently, four groups of his animals were given recovery 
intervals of 5.5, 25, 65, and 185 minutes, after which extinc- 
tion was resumed to the same five-minute criterion. The aver- 
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age numbers of responses accumulated in the respective 
groups were 7.6, 14.4, 19.5, and 24.4. 

Extinction and Forgetting 

The beginner in psychology often takes extinction to be 
synonymous with forgetting. A close examination of forget- 
ting shows us, however, that matters are not quite as simple 
as this. Let us look, briefly, into the history of the problem. 
Back in 1885, Hermann von Ebbinghaus published a volume 
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FIG. 20. Curve of nonsense-syllable retention obtained by Ebbinghaus 
with the "savings method." This method takes the difference between 
original time taken to learn and relearning time after some period, and 
computes the "per cent time saved" (here called "per cent retained") 
by dividing the time difference by original learning time. Each period 
represents the learning and relearning of a different list of nonsense 
syllables. Ebbinghaus served as his own subject. The first three points 
plotted are for intervals of .33, i.oo, and 8.8 hours, respectively. (After 
Ebbinghaus, 1885.) 

entitled Ueber das Gedachtnis or, in English, On Memory. 
In this book he proposed methods for the quantitative study 
of human verbal learning and the retention of learned ma- 
terial, together with a large body of data he had painstakingly 
amassed in studying his own ability to learn and relearn series 
of nonsense syllables (rop, fim, zeb, etc.). We shall consider, 
in Chapter 7, the actual process of serial learning, and will 
have occasion to refer again to Ebbinghaus. In the present 
connection, we may note that one of the problems to which 
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he addressed himself was that of forgetting. The curve in 
Figure 20 shows how he found the amount of retention of 
nonsense syllables to depend upon the amount of time that 
passes between initial learning of a list and later relearning 
to a given degree of perfection. It indicates that the major 
loss in remembering occurs quickly after learning, but some 
small residue can be expected to survive for a long time. 

Despite Ebbinghaus' results and the supporting evidence 
later provided by other investigators, objections soon arose. 
We now know that his 'forgetting curve' is not broadly ap- 
plicable to other than nonsense material. Many observers 
have pointed to the recall of supposedly dead memories in 
dreams, hypnosis, and reveries. Psychiatrists have convinced 
us that, with suitable prompting, persons may recover mem- 
ories that had at first seemed hopelessly lost. Experimentalists, 
too, have added evidence which indicates that 'meaningful' 
material, such as prose and poetry, does not fade with time in 
the manner suggested by the Ebbinghaus curve. 

Scientifically, the meaning of forgetting should probably 
be restricted to the weakening of response which results ex- 
clusively from the lapse of time between conditioning and 
some later test of strength. Simple conditioned responses, 
operant or respondent, lend themselves to the study of this 
phenomenon. After the response is established, any length of 
time may be allowed to elapse before it is tested again and 
compared with its former strength. If forgetting takes place 
during the interval, the loss in strength should be reflected in, 
say, a decreased resistance to extinction of the response. The 
upshot of several studies has been that the mere disuse of a 
conditioned response yields very little diminution in strength 
over periods of months or years. Figure 21 presents aver- 
aged extinction curves of bar-pressing in two groups of rats. 
The upper curve was obtained on the day following the 
one in which 100 responses had been reinforced; the lower 
curve was obtained forty-five days after initial conditioning 
and the same number of reinforcements. Experimental ex- 
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tinction is, apparently, far more effective in weakening a 
response than is a period of passive disuse. Today, many 
investigators believe that, given ideal control over a creature's 
activity during the interval between learning and testing, a 
conditioned response would show no weakening at all. 
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FIG. gi. Average cumulative response curves for bar-pressing extinc- 
tion in two groups of rats extinguished after different periods of time 
following original conditioning. Only slight loss of strength results 
from the mere lapse of time. (From Skinner, 1938.) 

The seeming dilemma is not insoluble. The crucial differ- 
ence between the Ebbinghaus results and the conditioned- 
reflex results lies in the type of materials employed. Nonsense 
syllables, in long lists, memorized individually in a fixed serial 
order, are subject to much mutual confusion and blocking. 
The intervals between learning and relearning in Ebbing- 
haus' work were undoubtedly filled with verbal behavior of 
which nonsense syllables are special cases. These syllables 
probably cannot be isolated or protected from blending with 
other speech, and this would involve, in time, a loss of their 
identity. This is probably related to the fact that we 'forget' 
foreign languages and our school-day mathematics, but not 
how to swim or ride a bicycle. 

If we ask now whether there is such a thing as forgetting, 
the answer is probably No. A simple and distinct act probably 
does not expire through disuse; an intricate set of reactions 
may suffer, if not rehearsed, from internal attrition. If, in the 
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interval, other things are learned or practised which conflict 
with the original responses, 'forgetting' will be the result. 
The everyday use of the word forgetting does not, of course, 
distinguish between lapse of time and other factors that may 
account for a loss in retention. 

Regression: a By-product of Extinction 

Occasionally we see an adult behave, as we put it, "like a 
child 1 '; or a youngster who, seeing his parents' attention turn 
to a new infant, may once again wet himself, insist on being 
hand-fed, and refuse to sleep in a dark room or in his own bed. 
These and other instances have seemed to psychiatrists to 
illustrate in common a mechanism which they describe as a 
regression or going-back. They consider regression to be a 
retreat from one's more recently acquired behavior to that of 
an earlier period. From the critical writings on this subject, 
we may extract a central idea: If present behavior is not 
capable of getting reinforcement, one reverts to older forms 
of response which were once effective. An individual whose 
best efforts fail to reach a solution of his difficulties is thrown 
back upon other resources, and these resources are the ones 
he once used successfully but outgrew with maturation or 
social training. 

Clinical observations are always complex and it is dan- 
gerous to entertain over-simplified explanations of them. 
Nevertheless, the notion of regression has led to a search for 
a possible prototype which would give some degree of validity 
to the idea. Mowrer (1940), in the study cited earlier (page 
61), believed he found such a prototype. In his experiment, 
rats were conditioned to push a panel in order to turn off a 
shock coming from the cage floor. The first response to the 
shock, before panel-pushing was learned, was a tiptoe postur- 
ing or dance, which reduced the severity of the shock. With 
further training, this response diminished in favor of the 
more effective response to the panel. In a later extinction 
series, when panel-pushing was itself no longer reinforced by 
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shock-removal, the animals went back to the earlier mincing 
steps which had been their partially satisfying mode of coping 
with the noxious stimulus. Such findings as these have been 
reported by Hull (1934) and a number of other investigators 
(e.g., Masserman, 1946); and we shall have occasion to return 
to this matter later in connection with the problem of 'chain- 
ing* (Chapter 7.). 

One-Trial Extinction 

Extinction, as we have been describing it, is a pretty slow 
process, but it may have occurred to you that many of our 
own responses show a rapid decrement when reinforcement 
ceases that you, for example, would very soon stop making 
a response that no longer got results. With the exception of 
the feeble-minded (Mitrano, 1939) or the very young, no 
human being is likely to waste many coins on a vending 
machine that is out of order; and few of us will blindly persist 
in turning a key that no longer opens a door, or scratching 
matches that never light. 

It is undoubtedly true that extinction of an operant may 
be very rapid, even complete in a single emission (how many 
of us put two nickels in a coin box when the first is not re- 
turned?), but this statement is deceptive because it ignores 
the fact that immediate extinction is not a property of orig- 
inal behavior. We learn the signs of failure by virtue of long 
training, and these signs come to govern the number of re- 
sponses made in extinction. We discriminate, that is, between 
reinforcing and non-reinforcing situations (see Chapter 5). 

An analogue of such behavior can be found in lower 
organisms. At the beginning of an experimental hour, a rat 
is given a set number of regular reinforcements, say ten or 
twenty, for bar-pressing and then extinction is carried out for 
the rest of the hour. The same procedure is followed on the 
next day, and the next, for as long as we wish. The number of 
responses made in extinction on successive days decreases 
markedly, indicating that the very failure to obtain the rein- 
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forcement begins to act as a signal to cease responding (Bul- 
lock and Fischer, 1950). It is basically this process, which hu- 
man beings improve upon over the rat by direct verbal 
instruction, that enables a response to be dropped at once. 
To speak of it without qualification as one-trial extinction is 
incorrect. It would be equally erroneous to do so in the case of 
a conditioned Type S reaction like salivation which, in suc- 
cessive extinctions following reconditioning, may get to a 
point where a single non-reinforced elicitation is enough to 
drop the response to zero magnitude (Pavlov, 1927). 

Periodic Reconditioning 

The interspersal of conditioning and extinction that we 
have just been discussing leads us into an extremely interest- 
ing area. The regular reinforcement of response is not the 
world's rule; and some important properties of behavior 
should arise from periods of extinction that are broken into 
by occasional reinforcements. Laboratory research has, in fact, 
disclosed that this is the case; and we may now examine some 
of the results obtained by the systematic repetition of condi- 
tioning and extinction. 

One such procedure, which has proved to be a valuable 
research tool, is based upon the use of single reinforcements 
that are periodically provided throughout the experimental 
session. The schedule may be of two kinds: in one, the rein- 
forcements are separated by a fixed time interval; in the other, 
they are separated by a fixed number of unreinforced re> 
spouses. Let us take them up in order. 

Periodic Reconditioning at Fixed Intervals 

To illustrate the first procedure, we take a hungry rat and 
our representative bar-pressing response, and we choose three 
minutes as a between-reinforcement interval. The animal's 
first response to the bar is rewarded with a food-pellet, but 
none of the responses in the next three minutes is permitted 
to be effective. The very first response after the interval is 
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reinforced, and the responses in the next three minutes again 
go unreinforced. And so on, for as many cycles of reinforce- 
ment and non-reinforcement as we desire. (The interval of 
extinction responses is not precisely 'fixed,' of course, since 
the rat may not always respond at exactly the time when the 
food-pellet becomes available, but excessive delay is very rare 
because of the steady high rate of responding which usually 
develops.) If we are patient and do not cut the procedure off 
too soon, we get an interesting set of changes. The cumulative 
response curve passes through three stages. 

1. At first, the intervals of unreinforced responding are 
likely to contain little curves which bend over, and which are 
actually small extinction curves. Each reinforcement brings 
out some responses, but not many, and they dwindle during 
the interval. Figure 22-A reproduces a record obtained under 
these conditions. You will observe that the little curves sum 
up so as to give a positively accelerated trend to the over-all 
curve. 

2. When reconditioning has been continued for a while, 
the positive acceleration comes to an end, and the rates within 
the intervals fuse into a steady stream of responding. The 
temporally spaced reinforcements suffice to maintain a con- 
sistent strength of response which accounts for the straight- 
line appearance of the cumulative record. The beginning of 
such a rate is seen in the later portion of Figure 22-A, and 
throughout Figure 22-B. 

LEGEND FOR FIG. 22 

A. The first stage of P-R (periodic reconditioning) training for a 
white rat. The vertical strokes in the curve mark the reinforcements. 
Note the extinction-like shape of the cumulative response curve in the 
first few periods, and the later suggestion of a steady rate. 

B. The practically linear cumulative response record obtained as the 
second stage in P-R responding. The P-R interval was three minutes, 
and this portion of the record occurred after about 60 reinforcements. 

C. The third stage of P-R responding, showing the presence of a 
temporal discrimination. This record is from the same rat as in B, 
and was obtained after 17 days of training (one hour, or 20 reinforce- 
ments, per day) . (After Skinner, 1938.) 
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3. After many reconditioning sessions, the response curves 
during the three-minute intervals take on a scalloped appear- 
ance opposite to that in stage i. Right after a reinforcement, 
the animal delays his response for a time, but, as the seconds 
slip away, he starts up again, slowly at first, and then faster, 
until the interval is over and the next reinforcement is re- 
ceived. This shows that a temporal discrimination has been 
formed. The responses just after eating a pellet are weakened 
because they are never reinforced, whereas later responses are 
strengthened because the reinforcement is given later. The 
animal comes to "tell the time*' at which responding is more 
likely to succeed. Since, in this case, reconditioning is pe- 
riodic, some degree of temporal discrimination is unavoidable 
in the long run. The curve in Figure 22-C, obtained after 
seventeen one-hour sessions of periodic reconditioning, shows 
this clearly. You will note the essential similarity of this find- 
ing to that described in our treatment of the trace reflex in 
Chapter 2. In the course of trace-reflex formation, salivation 
to the conditioned stimulus is increasingly delayed until it 
comes just before the reinforcement. 

The close examination of curves, like that you have just 
made, is an example of honoring small details, an occupation 
that has paid rich dividends in the history of science. Let us, 
then, consider another example. (And let us, for convenience, 
adopt the practice of using "P-R" for "periodic recondition- 
ing.") Figure 23 presents a telescoped P-R curve for the same 
animal that provided the curves for Figures 22-B and 22-C. It 
gives us an over-all view of response rate during twenty-four 
experimental hours. With the details washed out, another 
trend emerges. (This can best be seen by raising your book 
in the horizontal plane and sighting along the curve from its 
near end.) The curve bends over slightly with continued 
training, as the number of unreinforced responses falls off. 
This is not unlike the drop in response frequency mentioned 
earlier in connection with "one-trial" extinction, but it is 
much less marked. Apparently the temporal discrimination 
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which controls the number of responses is, in this case, slow 
to form. One might say that it is harder to 'tell when* the next 
reinforcement is coming than to 'tell that* there will be no 
more. Even single reinforcements, properly spaced, keep the 
response strength high for a long time. 
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FIG. 23. Cumulative response curve under three-minute P-R over an 
extended period of time. Note the slow over-all negative acceleration 
that probably shows the control over responding gradually assumed by 
the temporal discrimination. Figures 22-B and 22-C are enlarged por- 
tions of this curve. (After Skinner, 1938.) 
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Response Rate and P-R Interval 

In the above illustration, a fixed interval of three minutes 
was used. You may wonder what the effect would be if the 
length of this interval were changed. The answer to this query 
has been given tentatively by Skinner (1938). Within limits, 
the greater the interval length the lower the rate of response; 
and changing the interval for an individual animal will 
change the slope of his curve. Since the mere specification of 
slope does not tell us what interval gave rise to it, we need a 
term for the slope-interval combination. Skinner has supplied 
this in his extinction ratio, which refers to the ratio of unrein- 
forced to reinforced responses emitted. This ratio may be 
obtained by taking the number of responses made in an hour 
and dividing it by the number of intervals within the hour. 
For example, suppose a rat, on P-R at three-minute intervals, 
has made 346 responses in one hour. Since there are twenty 
intervals in the hour, his extinction ratio will be 346/20, or 
17.3 responses per reinforcement. The ratio tells us how 
"'willing" the animal is to make unreinforced responses when 
the reinforcements are spaced as they are that is, what effect 
a given spacing of reinforcement has upon the strength of the 
response. 

For the present, two things only need to be noted about 
the extinction ratio, (i) Its size does not seem to vary with 
the length of the interval. The data on this point are meagre, 
but they indicate that slower rates over longer intervals give 
the same ratio as faster rates over shorter intervals. (2) Rats 
that differ with respect to ratio size, differ in the same way 
when resistance to extinction is measured. 

Relative to our second point are findings obtained by 120 
students in a General Psychology course at Columbia College 
in 1946. Each student worked with a single rat and, in one 
class experiment, a two-minute P-R was employed to deter- 
mine the extinction ratio for each animal, after which the 
bar-pressing response was extinguished. The 120 rats were 
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ranked for size of extinction ratio and for number of re- 
sponses during one hour of extinction. The coefficient of 
correlation between the two ranks was +.72, indicating that 
if a rat gives a large number of unreinforced responses during 
P-R he will also give a large number in extinction. This had 
been predicted by the class, and the results bore out the pre- 
diction, but the finding needs to be checked with a more 
rigorously controlled experiment. 

Extinction After Periodic Reconditioning 

One thing is very certain: P-R increases the resistance of a 
response to extinction. This finding comes out of experiments 
like the following. One group of hungry rats is given twenty 
regular reinforcements with food for bar-pressing, and the 
response is then extinguished. A second group also receives 
twenty reinforcements before extinction, but these reinforce- 
ments are spaced out in a three-minute P-R schedule. Suppose 
that extinction in both cases is carried out for four hours, one 
hour per day on successive days. When all the extinction re- 
sponses of the two groups are added up, the second group 
comes out far ahead in the total number made. 

This simple experiment leads to some additional com- 
parisons. As we saw earlier, extinction following regular 
reinforcement starts with a sharp initial spurt in rate, 
exceeding that which prevailed during regular reinforcement. 
Afterwards, the extinction curve is marked by wave-like 
depressions and accelerations in responding. In contrast, pe- 
riodic reconditioning gives rise to an extinction curve which 
does not noticeably exceed the previous P-R rate. If we com- 
pared the two extinctions for only a brief period after rein- 
forcement had ceased, regular reinforcement would appear 
to have generated a greater resistance to extinction than P-R. 
Such a conclusion, however, would be unjustified, since a con- 
tinuation of extinction would reveal that the responding after 
P-R goes on unabated for a considerable time, whereas that 
after regular reinforcement soon tapers off. Here, as in other 
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circumstances, it is wise to wait until all the returns are in. 

Not only is extinction after P-R more resistant, but the 

curve is also smoother than that after regular reinforcement. 
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A. Cumulative response curve of extinction immediately following 
P-R. Note the smoothness of this curve as compared with extinction 
after regular reinforcement (Fig. 16). 

B. Cumulative response curve of extinction following P-R. Compared 
with extinction after regular reinforcement, responding goes on un- 
abated for a considerable time. (After Skinner, 1938.) 

The vacillations between aggressive attacks on the bar and 
depressions in responding, which come after regular rein- 
forcement is discontinued, do not occur so markedly after 
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P-R. The greater smoothness of extinction suggests that P-R 
increases a creature's "frustration tolerance." Intrinsic to 
P-R is the recurrence of periods of non-reinforcement during 
which "frustration" is repeatedly experienced and overcome 
by continued responding. We might expect the same sort of 
result in training children. Occasional reinforcement gives 
stability to behavior, and persistence in the face of failure. 
Skinner has argued that this stability, as well as the increased 
resistance to extinction following P-R, is a significant prop- 
erty of our normal behavior, and responsible for "equanim- 
ity in a world in which the contingency of reinforcing stimuli 
is necessarily uncertain." He points out that our behavior 
"would be clumsy and inefficient if the strength of an operant 
were to oscillate from one extreme to another with the pres- 
ence or absence of its reinforcement" (The behavior of 
organisms, 1938, p. 138.) 

Fixed-Ratio Periodic Reconditioning 

In addition to withholding reinforcement for a fixed in- 
terval of time, we can make it wait upon the emission of a 
certain number of unreinforced responses that is, we can 
establish a fixed ratio of unreinforced to reinforced responses. 
This procedure brings to mind cases in which a set task must 
be carried out before we reap a reward. If we use it carefully 
within the laboratory, it should enable us to make some inter- 
esting discoveries about our own behavior. 

The experimental scheme is as follows. A rat is first trained 
under P-R at a fixed interval of, say, five minutes, until his 
response rate is stabilized. Suppose we find that his extinction 
ratio is 10:1 (ten unreinforced responses for each reinforced 
one), and we then decide to change over to fixed-ratio rein- 
forcement, making food contingent upon a given number of 
responses. Three alternatives are open to us: we can set a 
fixed ratio (i) less than, (2) equal to, or (3) greater than the 
extinction ratio. 

If a reinforcement is given for less than ten responses, it 
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will come sooner than usual within each interval that is, it 
will take him less than five minutes to get his pellet. If it 
comes sooner, there will be more frequent reinforcement, and 
this will result in an increased rate of response just as it does 
with shorter fixed-interval P-R (see page 88). The new rate, 
in turn, will bring reinforcement even more quickly, and the 
process will repeat itself until a limiting high rate of response 
is achieved. The more the fixed ratio falls short of the extinc- 
tion ratio, the faster the acceleration in rate and the earlier 
the final rate-limit will be reached. Figure 25 shows this 
schematically for four fixed ratios below our assumed extinc- 
tion ratio of 10:1. 
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FIG. 25. Hypothetical curves showing acceleration of the response 
rate under several fixed ratios smaller than the 10:1 extinction ratio 
represented by the straight-line P-R curve. (From Skinner, 1938.) 

Consider, now, the hypothetical effect of using a fixed ratio 
which is the same as the extinction ratio that is, 10:1. Pre- 
sumably, the frequency of reinforcement will continue to give 
us the same slope of curve as the five-minute P-R with which 
we started out, since the fixed ratio calls for the same number 
of unreinforced responses as would be emitted, on the aver- 
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age, in the five-minute interval. This turns out to be the case, 
at least for a while. After a time, however, there is an accelera- 
tion like that which occurs when the fixed ratio is less than 
the extinction ratio. A rat may maintain his fixed-interval 
slope for as long as three experimental hours, whereupon he 
may (sometimes suddenly) assume a new and higher rate of 
responding. What could have 'informed' him that reinforce- 
ment was contingent upon the number of responses rather 
than the interval, and that he might rather respond rapidly? 
The answer lies in the fact that the extinction ratio varies from 
time to time. Ten-to-one was an average ratio; at one time 
or another it was certain to be more than that sufficiently 
so to trip off the circular accelerative process. Such variations 
are especially common in an animal's natural habitat, and are 
accentuated by fluctuations in his level of motivation which, 
in turn, determines the size of the extinction ratio (see 
Chapter 9) . 

When the fixed ratio is greater than the extinction ratio, 
we require more than ten responses before we reinforce. This 
means that reinforcements come, on the average, less often 
than every five minutes. The rate of responding should, there- 
fore, go down. But a lower response rate at fixed ratio de- 
creases the reinforcement frequency still further. We have a 
circular interaction in which response rate continually 
decelerates and finally extinguishes. The rapidity of the ex- 
tinction will depend upon how much the fixed ratio exceeds 
the extinction ratio. Although reinforcements will be re- 
ceived every now and then, the response rate , cannot be 
sustained. Even if it were sustained, a high fixed ratio might 
involve more loss of energy than the single reinforcements 
could offset, and the animal would waste away because of this 
unfavorable balance of food intake and energy output. 

The running response in an apparatus like that of Graham 
and Gagn (page 54) also reveals the basic properties of 
fixed-ratio reinforcement. Since the runway is of fixed length, 
we can think of it in a rough way as demanding a certain 
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number of steps for traversal. The distance is short, so the 
fixed ratio of unreinforced steps is comparatively small. The*, 
decreased running time on successive trials represents an 
acceleration in rate of step-responding, that is, the rat emits 
his running steps with increasing speed up to a limit. 
Although, unlike bar-pressing, the animal moves into a new 
stimulus-situation with each running response, the parallel 
between the two procedures is striking. 

The 'Ceiling of Fixed-Ratio Responding 

In the example just considered, where the fixed ratio ex- 
ceeded the extinction ratio, we said that the response extin- 
guishes. This happens whenever the fixed ratio is suddenly 
set at a value far above the extinction ratio and the same eftect 
would be achieved with the runway apparatus if we were 
suddenly to lengthen it to, say, a thousand feet. But there is 
another procedure, of a more gradual sort, by which we can 
establish very high fixed-ratio values. Taking our rat with the 
assumed extinction ratio of 10:1, let us begin with a fixed- 
ratio schedule of 8:1. The response rate will accelerate, as 
described above. Once the rate is up, consider the situation 
that exists. What is being reinforced? The responding, of 
course, but also the high rate itself is now feeling the effect. 
Every time a reinforcement comes it impinges upon responses 
coming out at a certain rate, so that not only the response is 
kept strong, but the rate, too, is being conditioned. The 
animal "learns" that bar-pressing is rewarded when it is fast. 
This is an aspect of Type R conditioning which we now meet 
for the first time, and it may pay you to stop for a moment to 
think it through. 

With our animal now responding at a high rate, we may 
raise the fixed ratio to a new level, perhaps of fourteen re- 
sponses. Bridging the small gap from eight to fourteen is not 
hard, because the animal is already conditioned to a high 
response rate, and the exact number of responses is somewhat 
elastic. He tends to keep up a high pressing speed until the 
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reinforcement is forthcoming. After the new ratio is strongly 
practised, it is raised once more; after a time, we increase it 
again. If the increases are not too great at any one step, a ratio 
can be reached far beyond the starting point. In one experi- 
ment (see Figure 26) , it was actually possible by a process of 
gradual approach and thorough training to reach and sustain 
a ratio of 192 responses per reinforcement, without extinc- 
tion! (Skinner, 1938.) Figure 26 also reveals in each curve 
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FIG. 26. Reinforcement at several fixed ratios. The cumulative re- 
sponse curves are for one rat that was gradually worked up to the ratio 
of 192:1. The horizontal lines indicate reinforcements. (After Skinner, 
1938-) 

the acceleration of response during periods of non-reinforce- 
ment. This acceleration resembles that shown in the P-R 
curve on page 84 and has a similar origin. Actually, a tem- 
poral discrimination, although of no "use" to the rat in fixed- 
ratio P-R, is nevertheless formed, since the experimental 
arrangement never permits one reinforcement to follow im- 
mediately after another. Or, think of it this way: receipt of a 
pellet is never followed immediately by another reinforce- 
ment, so the animal holds off further responding for a while. 
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But, when responding is resumed, the speed picks up again 
because reinforcement is less delayed by a quick completion 
of the fixed ratio than a slow one. In the vernacular, we would 
say that "the rat becomes more eager for the food pellet as he 
gets closer to winning it." 

This phenomenon is not alien to our everyday experience. 
Faced with a monotonous, routine job of definite length, like 
mowing a lawn, we are apt to hasten our work as we approach 
its end at which there are the waiting rewards of a cool drink 
and soft chair. Indeed, it is so natural a thing that we can 
spontaneously recognize its operation in other persons. Imag- 
ine a man who, after a long enforced separation, is returning 
to his family and home. As his objective comes into view, his 
pace quickens until, unable to hold back any longer, he 
breaks into a run. Or, imagine a couple who, under the stress 
of strong emotion, approach each other from a distance, at 
first slowly, but with gathering speed, until the last few steps 
before their embrace are covered at a run. Sensitive cinema 
directors have used such wordless scenes with great effect, 
knowing that they could count upon audiences' understand- 
ing and sympathy. 

The method of extending the fixed ratio by delaying rein- 
forcement and gradually advancing the size of the ratio, has 
been employed with rats in other than bar-pressing situations. 
A caged rat may be taught to pull a string to which a piece of 
food, lying outside the cage, is attached (e.g., Tolman, 1937). 
As sureness and speed develop in the short pulls, the string 
may be lengthened by slow degrees. Eventually, the string 
may reach a length well beyond that which the animal could 
initially have been conditioned to pull, but he hauls away 
heartily and persistently on each occasion. Or, a rat may be 
taught to dig sand from a tube through which he has pre- 
viously run unimpeded to obtain food (Stone, 1937). Once 
the response is well established, more and more sand may be 
let into the tube. If the increase is not too precipitous, a truly 
enormous amount of sand may be removed by the rat be- 



EXTINCTION AND RECONDITIONING 97 

tween reinforcements. The extension of the fixed ratio by a 
judicious manipulation of reinforcement bears a resemblance 
to the type of response training called "differentiation" (see 
Chapter 6). 

Extinction Following Fixed-Ratio Reinforcement 

Fixed-ratio reinforcement leads to extinction curves which 
are different from those following either regular or periodic 
reinforcement (fixed interval). The curve begins steeply, and 
maintains a maximal rate until the responses, quite abruptly, 
come to an end. Figure 27 shows this for one rat that had been 
working at a high ratio, and for one at a low ratio. The reason 
for the difference between these and the more usual extinc- 
tion curves may not at first be obvious. You will remember, 
however, that fixed-ratio reinforcement strengthens not only 
the individual response, but also the rate of response. When a 
reinforcement comes, the rate is usually maximal. A high 
rate of responding is usually followed by a reinforcement and, 
adopting a term from the following chapter, we can say that 
the animal "discriminates" his own rate of responding. In his 
world, the occurrence of fast responding marks the time when 
reinforcement can be 'expected/ hence it sets the occasion for 
more responding. When extinction has begun and he reaches 
the end of his fixed ratio, no pellet is discharged, yet the situa- 
tion is the normal one for reinforcement. Consequently, he 
keeps on responding rapidly because, in his history, reinforce- 
ment has come on such an occasion. Each new response makes 
the occasion still more appropriate for further responding, 
with the result that the responses continue to come out at a 
sustained maximal rate. This cannot go on indefinitely, be- 
cause past training has given the response only a limited 
strength with which to resist extinction. When the break 
comes, it comes suddenly. A tempting analogy is that which 
likens the response tendency to a reservoir that is draining off 
at full flush under high constant pressure right up to the point 
of total depletion. 
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In contrast with this process, extinction following fixed- 
interval P-R or regular reinforcement shows little influence 
of a special "discriminative value'* attached to the rate of re- 
sponse. The effect of rein- 
forcement is primarily to 
strengthen the individual 
response, irrespective of 

isoo [- r rate, while non-reinforce- 

ment decreases that 
strength. In fixed-ratio 
reinforcement, each non- 
reinforcement adds to the 
'occasion' for more re- 
sponding and hastens the 
appearance of the next 
response. These different 
consequences of non-rein- 
forcement arise, of course, 
from the opposed train- 
ing procedures. Since re- 

500 1 / inforcement is connected 

with different features of 
responding, we can ex- 
pect different extinction 
characteristics. 

The Effect of Aperiodic 
Reinforcement 

Outside of the labora- 
tory, regular reinforce- 
ment is by no means the 
rule, but neither is strictly 

periodic reinforcement. It is hardly to be expected that a 
schedule of any fixed interval or any fixed number of re- 
sponses would be scrupulously honored by an environment 
so crowded with different events. We may well ask, then, 
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FIG. 27. Cumulative response ex- 
tinction curves of two rats after fixed- 
ratio P-R. (After Skinner, 1938.) 
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whether the results of aperiodic reinforcement are the same 
as those of periodic or regular reinforcement. 

Following Skinner's (1933) early studies of P-R with rats, 
a number of other scientists tried out modified forms of inter- 
mittent reinforcement with other organisms and other re- 
sponses. Brogden (19393), using dogs as subjects, conditioned 
salivation, or foot-withdrawal from shock, with regular rein- 
forcement. Subsequently, the animals were given training 
with aperiodic reinforcement. He found that response 
strength was maintained at a high level even when no more 
than 40 per cent of the responses were reinforced. Apparently 
the effect of the aperiodic procedure was strong enough to 
offset that of the relatively small number of reinforcements 
provided. 

Humphreys' (1939) experiments in this field are of especial 
interest because he used human subjects. He studied the con- 
ditioning of an eye-wink response and, later, the galvanic 
skin response (change in electrical resistance of the skin). In 
the first experiment, a light of short duration was followed, 
after 400 milliseconds, by a puff of air to the eyeball. Condi- 
tioning was assumed to be present when the eye-closure, 
originally elicited by the air-puff, occurred regularly in re- 
sponse to the light and in advance of the puff. One group of 
subjects was regularly reinforced, the air-puff following each 
presentation of the light on ninety-six occasions. Another 
group, given "partial reinforcement," received the air-puff 
after the light only 50 per cent of the time that is, on forty- 
eight of the ninety-six occasions. In spite of this difference in 
the number of reinforcements, the conditioning was set up as 
readily in one group as in the other. 

In conditioning the galvanic skin response, Humphreys 
paired a tone with a mild electric shock (the unconditioned 
stimulus for the change in skin resistance) under two rein- 
forcement schedules. Members of a regular (100 per cent) 
reinforcement group were given sixteen combinations of tone 
and shock; and members of an aperiodic (50 per cent) rein- 
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forcement group were given only eight combinations, ran- 
domly interspersed with eight presentations of tone alone. 
Test trials, with tone alone, followed each training period. 
Again, the results showed the two schedules to be about 
equally effective in maintaining response strength. 

Extinction trials were given at the conclusion of each of 
these experiments and, in both cases, a greater resistance to 
extinction was shown by the aperiodically reinforced re- 
sponses. The effect was more striking with the conditioned 
eye-wink, but was clearly present with the galvanic skin re- 
sponsea finding confirmed in a later study (Humphreys, 
1940). This is probably what you would expect, since ape- 
riodic would seem to resemble periodic more than regular 
reinforcement, but these experiments do not entirely satisfy 
our curiosity. One of them dealt exclusively with respondent 
conditioning, and the other involved operant-respondent 
overlap. Neither was based upon the emission of straight- 
forward operant responses such as make up the bulk of our 
everyday behavior. 

When we turn to the relatively unrestricted or 'free* situa- 
tion provided in the bar-pressing type of experiment, we find 
only a few published studies to date (Humphreys, 1943; 
Mowrer and Jones, 1945; Jenkins and Clayton, 1949; Skinner, 
1950). The effect of such a reinforcement schedule can be 
described, however, with some assurance. If bar-pressing is 
reinforced at intervals of four, three, two, and one minutes, 
together with some reinforcements that come immediately 
after a reinforcement (a 'zero interval'), with these intervals 
appearing in random order, a very steady rate of responding 
will quickly develop. Even after many days of training, the 
straight-line character of cumulative-response curves will be 
maintained, since no time discrimination is present to give 
the records a scalloped effect like that observed in fixed- 
interval or fixed-ratio P-R. Also, a great resistance to extinc- 
tion will be built up by such a procedure possibly a greater 
resistance than that which results from P-R at fixed intervals 
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or fixed ratio. The extinction curve itself will show little, if 
any, initial acceleration, and its course will be marked by 
only minor fluctuations in rate. Under certain conditions, as 
when the organism has been accustomed to relatively long 
intervals of non-reinforcement in his training sessions, the 
rate during a large part of the extinction curve will be indis- 
tinguishable from the rate that preceded extinction. 

Casual observation suggests, of course, that the operant 
behavior of human beings in their daily affairs is greatly 
affected by aperiodic reinforcement. In very few spheres of 
human activity is reinforcement either regular or strictly 
periodic, and, in certain cases, the effect of this aperiodicity is 
dramatically impressive. The chronic gambler, whose infre- 
quent winnings do not keep him from trying his luck again; 
the "punch-drunk" pugilist who stumbles into the ring long 
after his fistic power has waned; even the oft-disappointed 
farmer who tills his land once more in the hope of a bumper 
crop do not these all suggest the strengthening effect of occa- 
sional, unpredictably spaced rewards? 

With the facts of P-R in hand, you should be able to make 
some critical deductions about educational procedures which 
strive to control behavior. You should, for example, see how 
one would go about teaching a child to be persistent in the 
face of failure. One would make sure, in training for skill, for 
confidence at work, or for willingness to persist in social 
activities, that the child is guaranteed some measure of success 
and approval regularly at first, but later only occasionally, 
so that he will not give up in the face of setbacks. In the case 
of emotion, where Type S reactions play an important role, 
you should be wary of using fear stimuli, like talk of bogey- 
men, to make the child obedient. Once conditioned, fears are 
difficult to extinguish; and great care must be taken there- 
after to avoid even accidental reconditioning by stimuli which 
would have gotten little notice prior to conditioning. Also, 
if some unwanted behavior in a child is to be extinguished, 
even one surrender by the parent, in a moment of fatigue or 
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embarrassment, might renew the behavior strength more than 
it would have if extinction had not been undertaken in the 
first place. To one who would break a habit, we might para- 
phrase William James and say: Never suffer reinforcements 
to occur during the extinction process, else you may inordi- 
nately strengthen the very response it was your aim to weaken 1 

Superstition: An Experimental Example 

Superstitions may be considered as beliefs or practices 
based upon assumed "if . . . then'* relations which are either 
false or undemonstrable. // a black cat crosses my path, then 
I shall have a bad day; if I take a turn around the table, my 
luck at cards will change; if there is a solar eclipse, disaster 
will befall the tribe; rain on this day means rain each day for 
the next forty; a newborn infant must not be praised aloud, 
lest spirits overhear and harm him through jealousy. Cause- 
effect connections, or sequences of events, are thought to 
operate which have no basis in fact. In groping to understand 
the troubles and joys of life, men often invent explanatory 
forces or causes which they hope can then be dealt with by 
placation or counter-measures. Superstitions are frequently 
based on anxiety about what a dimly-grasped natural environ- 
ment holds in store for us, and it is this emotional quality 
which secures their unreasoning acceptance and unshakable 
grip. Some superstitions are widely held, a social heritage of 
beliefs taught us by our elders, but others are private convic- 
tions arising, we think, from valid personal experiences. 

One class of superstitions is that in which the supposed 
cause-effect relation involves some act of our own as the agent 
in producing a given environmental effect. This effect may be 
the procurement of a positive reinforcer or the averting of a 
negative. If I wish down a well, my wish will come true; if I 
carry a rabbit's foot, no harm can befall me; if I pierce the 
manikin's heart, my enemy will die; if I perform this ritual 
in the morning, I insure against failure during the day. Such 
instances spring from the temporal contiguity of a response 
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and a reinforcement in the individual's own history or in his 
observation of others. As you are well aware, a reinforcement 
will strengthen any act which precedes it, and this is true 
even when the contingency is accidental. The correlation 
may be 'false/ but this will not side-track the conditioning. 
So far as the organism 'knows,' his reaction was indeed the 
effecting agent. With strengthening, the response appears 
more frequently, and the probability of another coincidental 
reinforcement increases. Thereafter, only occasional contin- 
gencies are needed to keep up the 'magical' response at some 
strength. 

This situation has been duplicated experimentally in a 
simple manner (Skinner, iQ48a). The method of establishing 
a 'ritualistic superstition' is essentially that of periodic rein- 
forcement, except that the reinforcement is not made to 
depend upon the emission of any arbitrarily selected operant. 
Into a hungry pigeon's cage a small portion of food is pre- 
sented at fifteen-second intervals. The presentation is gov- 
erned automatically by a clock and, once the mechanism has 
been started, requires no attention from the experimenter, 
who may even retire from the scene, leaving the animal to 
its own devices. An hour or so later, if he returns to observe 
the result, he will find some interesting behavior. The bird 
may be walking systematically in circles; he may be tossing 
his head repeatedly, or swinging it from left to right in a 
pendular motion; he may be making regular brushing move- 
ments with his beak in the direction of the floor, or poking it 
into the upper corners of the cage. He has been caught, as it 
were, by the accident of reinforcement, supplied by an un- 
witting environment, at a chance-point in his on-going be- 
havior. He has acquired a 'personal superstition.' The response 
is conditioned faster with shorter between-reinforcement in- 
tervals, when the chance of extinction and the emission of 
alternative responses is lessened. Once the response is estab- 
lished, however, this interval may be lengthened to, say, one 
or two minutes without changing the behavior. Moreover, a 
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great resistance to extinction may develop. In one animal, 
more than 10,000 emissions of the response (a side-to-side 
hop) occurred before a fifteen-minute period of non-respond- 
ing appeared. 

Punishment and Extinction 

Almost any discussion of the control of behavior eventually 
runs head-on into the question of the part played by "punish- 
ment." At various times and in various places, men have based 
their ideas of formal education, discipline, and social training 
upon the premise that punishment affects behavior in a man- 
ner the opposite to that of reward. In other words, it has been 
supposed by many to "stamp out" behavior just as reward 
"stamps it in." On the other hand, throughout the years, iso- 
lated observers and thinkers have been skeptical about the 
long-term effectiveness of this kind of behavioral control. As 
a rule, these men did not deny that punishment influenced 
behavior, but they questioned the permanence, as well as the 
desirability, of such influence. 

Sigmund Freud, the psychoanalyst, was one whose clinical 
experience led him to the latter view. He believed that one's 
early development is of great importance in shaping his adult 
personality; and he argued that, during this development and 
later, many desires and acts (mainly sexual) must be kept 
from overt expression if one is to become a socially acceptable 
human being. Overt expression is prevented by punishment 
or the threat of punishment, but the wishes and tendencies 
toward the forbidden behavior are not thereby done away 
with. They remain alive and forceful, although one may be 
unaware of their existence within him. Freud described this 
banishment of acts from overt expression as "repression" to 
the "subconscious." 

Thorndike, on the basis of human learning experiments, 
also came to doubt the efficacy of punishment as a means of 
permanently weakening behavior. In 1931, he modified his 
famous 'law of effect* so as to exclude the idea that annoyers 
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were the opposite of satisfiers in their effect upon response, 
asserting that "Annoyers do not act on learning, in general, 
by weakening whatever connection they follow." Again, in 
1932, he stated that "a satisfying after-effect which belongs to 
a connection can be relied on to strengthen the connection . . . 
but there is no evidence that an annoyer takes away strength 
... in any way comparable to the way in which a satisfying af- 
ter-effect adds strength." 

Neither of these men can be said to have solved the prob- 
lem of punishment to the satisfaction of the laboratory scien- 
tist. Thorndike's experiments were concerned almost entirely 
with the effect of a spoken right or wrong upon the strength 
of verbal connections such as those formed when one mem- 
orizes a series of word-number associations; and Freud's view 
was part of an elaborate conceptual system which laid no 
claim to experimental support either in its origins or its im- 
plications. The net result of their teachings, as you might 
guess, was not the blind acceptance of doctrine, but a quick- 
ening and a broadening of laboratory research; and it is out 
of such research that the true status of punishment may be 
expected to emerge. 

We can best approach this matter here by referring back to 
our discussion of positive and negative reinforcement. It was 
pointed out, in Chapter 3, that certain stimuli, such as loud 
sounds or electric shocks, belong within a class of negative 
reinforcers: when presented, they weaken operant behavior; 
when removed, they strengthen it; and they also possess elicit- 
ing power. Let us examine, more closely than before, the 
weakening effect of such stimuli. 

Negative reinforcement can be made contingent upon a 
response that is, the response can bring it on. This is an im- 
portant type of "punishment," as the word is commonly used. 
The contingency may be provided by nature (a child's finger 
is always burned when thrust into a flame) or through the 
behavior of another organism (the hand may be slapped when 
the finger is placed in the mouth). In either instance, the child 
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is "punished," but psychologists have, naturally, been more 
interested in the social than the non-social case. 

Punishment may be applied to an operant under two gen- 
eral conditions: while the response is (a) undergoing positive 
reinforcement (regular or otherwise) to see if it will nullify 
the effect of positive reinforcement; and (b) when it is being 
extinguished, to see if it will subtract from the strength al- 
ready possessed by the response. Both cases are interesting, but 
the latter has more practical and theoretical significance, since 
it is closely related to the age-old question of the permanency 
of punishment's effects. We begin, therefore, by asking 
whether punishment, applied to an operant during extinc- 
tion, will have a short- or a long-term influence upon the 
strength of the response. 

This problem was first attacked, in an exploratory way, by 
Skinner (1938), in connection with his studies of operant be- 
havior in the white rat. Two groups of animals were given 
periodic reconditioning for three days. Extinction was then 
carried out in two periods, of two hours each. With one 
group, all responses made in the first ten minutes of the first 
period were punished; with the other group, extinction took 
place in the usual way, no responses being punished. The 
punishment for the first group consisted of a slap adminis- 
tered by the bar itself whenever it was depressed. This slap 
was given to the animals' forepaws by a sharp return kick of 
the bar provided when an electric hammer struck against the 
bar shafts outside the response chamber. Figure 28 shows the 
average extinction curves for the two groups. The remarkable 
thing in these curves is that the effect of the slaps in suppress- 
ing the response was only temporary; the two groups eventu- 
ally came out even in the total number of responses emitted 
during extinction. Apparently punishment did not subtract 
from the over-all strength of the response, and Skinner con- 
cluded that the experiment gave "no evidence whatsoever for 
a process of negative conditioning directly the opposite of 
positive conditioning." 
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FIG. 28. The effect of punishment on extinction. Experimenter's 
comment: "The two curves are from groups of four rats each, with the 
same experimental history. All responses made by one group during the 
first ten minutes of extinction were slapped. The rate is depressed for 
some time but eventually complete recovery is made." (From Skinner, 
1938.) 

That more remained to be learned about the effects of pun- 
ishment was indicated by a second experiment in which four 
rats, after P-R training, were given five extinction periods of 
one hour each. During the first two days and forty minutes of 
the third, no responses were slapped. For the last twenty min- 
utes of the third day, and the entire hour of the fourth, all 
responses were slapped. On the fifth day, unpunished extinc- 
tion was resumed. The average curve for the four animals is 
given in Figure 29. It is plain that there was no recovery from 
the punishment during the fifth day. It appears that a pro- 
longed period of slapping had a more lasting effect than the 
short one employed in the first experiment. We cannot say, 
however, that the effect was permanent, since no further ex- 
tinction tests were given. 

A recent extended and significant research dealing with 
the punishment of operant behavior is that of Estes (1944). 
This investigator, taking off from Skinner's observations, con- 
cerned himself primarily with the effect of electric shock upon 
bar-pressing rate. In a long series of experiments, he showed 
(i) that, after two or three one-hour sessions of fixed-interval 
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P-R, a short period of mild punishment during the early part 
of extinction will depress the response rate temporarily, but 
will not decrease the number of responses required for sub- 
sequent complete extinction; (2) that a long period of mild 
punishment or a period of severe punishment (either short 
or long) will produce an appreciable decrease in the number 
of later responses, but will not decrease the time required for 
complete extinction; and (3) that, when a greater amount of 
P-R precedes the extinction period during which punishment 
is given for each response, there will be considerably less ef- 
fect upon the number of responses finally emitted. 
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FIG. 29. The effect of extended punishment on the average extinc- 
tion curve of four rats, the extinction being carried out after P-R at 
fixed intervals. (After Skinner, 1938.) 

These results tell us that the effect of punishing the bar- 
pressing response is not permanent, at least in terms of the 
time required for extinction when punishment has been 
stopped; and that, if the response is strongly conditioned in 
advance, it may not even reduce the number to be emitted 
later. But Estes* next finding is still more dramatic. (4) The 
effect of electric shock was no greater when bar-pressing 
brought it on than when it was delivered independently of 
the response. When shock was given only at times when the 
animal was not pressing the bar, there was the same immedi- 
ate depression of rate and the same slow recovery as when the 
response itself was punished. In a word, the effect was non- 
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specific. To Estes, this suggested that the important relation- 
ship was not between the punishment and the response, but 
between the punishment and the general situation in which 
it occurred. 

According to this analysis, shock is an unconditioned stim- 
ulus for a "changed state of the organism,'* involving emo- 
tional respondent changes and accompanied by a decreased 
frequency of bar-pressing. (See the following scheme.) 

General depression of 

S (shock) -^ opera nt rate and 

"emotional" responses 

^^ 

S (general- 
situation) 

Since the shock is delivered in a "general situation" the rat's 
cage environmentthis situation becomes, for all practical 
purposes, a conditioned stimulus for the behavioral changes 
evoked by the shock itself. Moreover, if this analysis is cor- 
rect, it would follow that the mere presentation of the 'con- 
ditioned situation/ unaccompanied by shock, would lead to 
extinction of the effect. 

In support of this hypothesis, Estes found that when bar- 
pressing was punished in the usual way during a short period 
of extinction, and this was followed by a two-hour period in 
which the rat was left in the response chamber, with no bar 
present and no further punishment, the effect of the shock 
was almost entirely absent when the bar was re-introduced 
and extinction was resumed. Leaving the rat in the chamber 
without the bar led to a nearly complete dissipation of the 
emotional upset caused by the shock. Except for a small de- 
pression at the start, the final extinction curve was in no dis- 
cernible way different from that provided by animals that 
had never been shocked. 

The just-mentioned small remainder of shock-effect was 
not ignored by Estes, who explained it in the following way. 
Punishment had been given to the bar-pressing response; and 
its effect had presumably been conditioned to all aspects of 
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the stimulus situation which were present when the response 
was made. Extinction of the effect had occurred in the pres- 
ence of most of these conditioned stimuli, but not all. The 
bar itself was absent during this extinction, and so was the 
response of pressing. Hence, the stimuli provided by the bar 
and the pressing movements themselves had not had a chance 
to lose their conditioned-stimulus status. Consequently, when 
the bar was returned to the chamber for the final extinction 
of the operant, the bar-stimuli and the (proprioceptive) press- 
ing-stimuli were still able to exercise a small but measurable 
influence upon the response rate. 

Estes' "changed state of the organism" was, as he said, "of 
the sort commonly called 'emotional,' " and we shall have 
more to say about this effect of punishment later, when we 
attack the general problem of emotion (Chapter 10). Already, 
however, we can see some interesting parallels of his rats' be- 
havior, in the field of everyday human conduct. Consider, for 
example, the not-uncommon experience of bringing down 
upon ourselves, through something we have said, the strong 
disapproval of others. (Disapproval itself may be negatively 
reinforcing Chapter 8.) The ill-chosen words have been 
uttered and the damage is done; we are covered with embar- 
rassment and confusion at the outcome. We are 'emotionally 
upset* and we seek escape from our predicament (as the rat 
often tries to climb out of his chamber) by one means or 
another. If our attempts fail, we remain to face the music, 
but for some time after the faux pas, and even when disap- 
proval is no longer in evidence, our behavior may be seriously 
depressed (just as the rat needs time to recover from his 
shock). If we do escape from the embarrassing situation, we 
need only to enter it again in order to re-experience the upset 
that it caused; indeed, the mere reinstatement of the fateful 
words, at some other time or in some other circumstances, 
may rearouse the old emotion in what seems to be full force. 
Repeated exposure to the situation or utterance of the pre- 
viously 'punished' words will, in time, cause the disturbance 
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to vanish just as, in the rat's case, repeated returns to the re- 
sponse chamber and further pressings of the bar will lead to 
an extinction of the effect and a recovery of operant strength. 

The major conclusion to be drawn from these studies is, of 
course, that the effects of punishment are likely to be im- 
permanent. When the emotional disturbance resulting there- 
from has disappeared, the punished act may still need a long 
period of extinction before it is eradicated from the organ- 
ism's repertory. If this is true, and if it applies to men as well 
as rats, we cannot but wonder why the use of punishment has 
been so widespread throughout the ages: why this truth re- 
ceived so little recognition. A moment's thought, however, 
will suggest two answers. First, if one disregards the restrictive 
and biologically uneconomical effects of punishment, he may 
use it in depressing "wrong" behavior and thereby pave the 
way for the strengthening of "right" behavior. This is a tech- 
nique still commonly met with in educational practice and 
often supported by experimental studies in human and ani- 
mal learning (Bunch, 1928, 1935; Dodson, 1932; Warden, 
1927; Gilbert, 1936, 1937; and others). Various researchers 
have shown that a combination of reward and mild punish- 
ment will reduce the time or the errors involved in the solu- 
tion of problems. 

The second reason is not often mentioned: the use of pun- 
ishment is positively reinforcing to the user. There is no gain- 
saying that punishment has its advantages in the control of 
others. Given the requisite physical strength, or the symbol 
thereof, we can always force others into a state of submission 
at least temporarily. Whatever annoyance they have pro- 
vided for us is thereby eliminated, and we are positively re- 
inforced. On the basis of this fact alone, it is easy to see why 
we live in a world where punishment or threat of punishment 
is the rule; where power or the signs of power are considered 
all-important in achieving social control. 
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Concluding Remarks 

Two new principles, those of operant and respondent ex- 
tinction, have been introduced in the present chapter, and 
we have shown how they operate in conjunction with op- 
erant and respondent conditioning to build up reflexes of 
exceptional strength; extinction, as well as conditioning, is 
involved in the procedures of periodic and aperiodic recon- 
ditioning. We have also presented evidence to show that the 
weakening effect of negative reinforcement (punishment), 
when added to that of non-reinforcement, is probably tem- 
porary. By now you should begin to appreciate the kind 
of analysis of behavior that we are attempting. Our aim, 
throughout this text, is to show how the complex may be ex- 
plained in terms of the simple. This will be even more ap- 
parent as we move along. Not many new principles will be 
added in later chapters; but these will be required for your 
understanding of a great many aspects of human nature 
which, for the average person, are often shrouded in darkness. 

NOTES 

Related to the effect of different amounts of reinforcement upon 
extinction (page 73) is the effect, reported by Jenkins and Clayton 
(1949), of different amounts upon the rate of response when aperiod- 
ically reinforced. Pigeons, trained to peck at a response key for food, 
were given either two or five seconds of eating time at irregularly spaced 
intervals. The mean number of pecking responses in half-hour sessions 
was found to be 1205 for the two-second amount and 1557 for the five- 
second amount. Unpublished observations of white rats in the Columbia 
laboratory indicate that a comparable effect may be obtained under 
aperiodic reinforcement with one versus two pellets of food. 

In discussing the relation between response strength and the amount 
of motivation present during conditioning, we mentioned only Strass- 
burger's (1950) study, which we felt to be the most conclusive treatment 
of this matter to date. We neglected, perhaps unfairly, to describe 
(i) the initial attack upon this problem by Finan (1940), who also used 
the bar-pressing technique; (2) a later study by MacDuff (1946), who 
used the maze; and (3) a still later study by Reynolds (1949) with the 
runway method. Finan found that conditioning under twelve hours of 
food deprivation provided the greatest response strength in his rats, 
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with twenty-four-, forty-eight-, and one-hour deprivations decreasingly 
effective. MacDuff's order of effectiveness, from most to least, was forty- 
eight-, twenty-four-, and twelve-hour deprivations. Reynolds, who has 
certain objections to both the Finan and MacDuff studies, concludes, 
from his own data and his predecessors', that we cannot yet say whether 
response strength is, or is not, a function of motivational strength at the 
time of conditioning. 

Characteristics of unconditioned or operant-level bar-pressing (see 
page 76) by rats when motivated by hunger have been described by 
Hefferline (1950), and have also been investigated by Schoenfcld, 
Antonitis, and Bersh (1950^ who obtained measures of such activity 
under both hunger and thirst. 

In the text, we said that forgetting is caused by what happens during 
the passage of time to interfere with, block, or extinguish the material- 
to-be-recalled. A more detailed statement, and some relevant data, may 
be found in Guthrie (1935) whose theoretical viewpoint is nevertheless 
different from the one we have adopted. 

In our discussion of aperiodic reconditioning, we cited experiments 
by Brogden (19393) and Humphreys (1939) in which avoidance responses 
were conditioned. Regular reinforcement was, in both cases, presumably 
compared with aperiodic reinforcement in its effects. This is, as you 
may have recognized, an over-simplification of what took place. An- 
ticipating our later treatment of avoidance behavior (Chapter 9), we 
may say that neither the shock nor the puff of air constituted the rein- 
forcement in these experiments. Rather, the leg-flexion and the eye-wink 
were strengthened by die removal of stimuli which had been regularly 
or irregularly associated with the shock or the air-puff. The periodicity 
or aperiodicity involved in such studies is of a different sort from that 
which is involved when bar-pressing is reinforced with food. 

Many investigations, not mentioned in this chapter, have been carried 
out for die purpose of determining whether punishment is an aid or a 
hindrance in the mastery of complex learning problems. Electric 
shock has been the favorite punishing agent, and, for obvious reasons, 
animals have been the principal subjects. Mazes and discrimination 
boxes (see page 137, Chapter 5) have been the most popular forms of 
apparatus; and the shock has commonly been applied in connection 
with an animal's choice-point behavior for example, a rat may be 
punished before, after, or at the moment of making a right or a wrong 
turn in a visual-discrimination test (Muenzinger, et al., 1934, 1935* 1936, 
1938). With respect to the solution of such problems, it now appears 
that there is no single answer to the question of how punishment affects 
learning. This will be quite understandable when you see that learning, 
in these situations, involves much more than mere conditioning, either 
operant or respondent. In addition, and still more important, is the 
fact that such a stimulus as electric shock may have any one of several 
functions. It may depress the response that it follows, or its removal may 
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strengthenas you already know. But it may also be informative, acting 
as a discriminative stimulus for a specific response (see Chapter 5). 
Under certain conditions, it may even be positively reinforcing when 
applied (Chapter 8). Any or all of these functions may be exercised in 
the maze or the discrimination box, depending upon the degree of 
shock employed and the kind of time relationship with stimuli and 
responses that may be provided. This has been made especially clear in 
the studies conducted by Muenzinger and his co-workers. 

The strongest opposition to the use of punishment in education 
comes from Thorndike's many studies of human learning. Thorndike 
argued that the effect of Wrong or other mild 'annoyers* is often to 
strengthen, rather than weaken, the responses they follow. Admitting 
that intense punishment might sometimes have a suppressive effect upon 
behavior, he pointed out that it was always a dangerous weapon, being 
most effective with sensitive persons, who need it least. At best, Thorn- 
dike argued, punishment is successfully applied only when it leads one 
"to shift to the right behavior and enjoy it, or to have such expectations 
from the wrong behavior that he is more comfortable to avoid it than 
to enter upon it." (The full import of the last part of this quotation 
will be clearer when you have read the discussion of avoidance behavior 
in Chapter 9 of this text.) 
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INSTANCES of this kind are so plentiful everywhere, that if I add one 
more, it is only for the pleasant oddness of it. It is of a young gentleman, 
who, having learnt to dance, and that to great perfection, there hap- 
pened to stand an old trunk in the room where he learnt. The idea of 
this remarkable piece of household stuff had so mixed itself with the 
turns and steps of all his dances, that though in that chamber he could 
dance excellently well, yet it was only whilst the trunk was there; nor 
could he perform well in any other place, unless that or some such other 
trunk had its due position in the room. 

John Locke, An Essay Concerning Human Understanding, 1690 

Stimulus Generalization 

This chapter begins with a single fact or characteristic of 
behavior: When an organism is conditioned to respond to one 
stimulus, it will respond in the same way to certain others. 
\Ve call this generalization and, as we proceed, we shall find 
that it helps us to explain a great deal of behavior that seems, 
at first glance, to be very complicated. 

The existence of stimulus generalization was discovered 
early in the history of psychology. Before 1910, Pavlov and 
his co-workers had observed it and reported upon it in these 
words: "If a tone of 1000 d.v. is established as a conditioned 
stimulus, many other tones spontaneously acquire similar 
properties. . . . The same is observed with stimulation of other 
receptor organs. This spontaneous development ... we have 
termed . . . generalization of stimuli . . .". The fact is now so 
well established in both respondent and operant behavior 
that we may state it as a principle 
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An increase or decrease in the strength of one reflex, 
through reinforcement or extinction, is accompanied 
by a similar but smaller increase or decrease in the 
strength of other reflexes that have stimulus properties 
in common with the first. 

A little thought will convince you that this principle is of 
importance to any organism in its daily life Our environ- 
ment is in perpetual flux, and it is very unlikely that any 
stimulus ever recurs in identical form. The visual stimuli sup- 
plied by a running rabbit to a pursuing fox, or by the face 
of a friend as you see it from moment to moment, are subject 
to countless variations in pattern, movement, brightness, and 
so forth, yet the fox continues its chase, and you do not feel 
yourself confronted by a procession of strangers. In the ever 
changing environment, the generalization of stimuli gives 
stability and consistency to our behavior. 

Stimulus Discrimination 

On the other hand, the behavior of organisms would be 
equally ineffective and unadaptive if it could never get over 
the barrier of generalization. An organism must be able to 
respond differently to different objects in its environment; 
and common observation tells us that it does. The fox pursues 
the rabbit, but not the hound; and we distinguish one friend's 
face from another's. Behavior can show a specificity with re- 
spect to stimuli, and when this specificity is developed in the 
face of generalization, we speak of stimulus discrimi?iation. 

An organism manifests a discrimination when it comes to 
respond to one, but not to the other, of two previously gen- 
eralizing stimuli. To be called discriminative, the response 
to the first must be maintained while the response to the sec- 
ond is weakened. In the laboratory and in everyday life, the 
development of this difference depends upon the reinforce- 
ment, or lack of reinforcement, that attends responses. The 
basic principle may, then, be stated as follows: 
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A reflex strengthened through generalization may be sep- 
arately extinguished, so that an organism responds to 
one stimulus and fails to respond, or responds less 
strongly, to another. 

In contrast with generalization, the process of discrimination 
gives our behavior its specificity, variety, and flexibility. 

Forming a Respondent Discrimination 

"Generalization" and "discrimination" are a natural pair, 
like the opposite poles of a magnet. Pavlov's (1927) way of 
showing how discriminations are formed is embodied in the 
"method of contrasts," which combines the procedures of re- 
inforcement and extinction. A hungry dog is first conditioned 
to salivate to a looo-cycle tone (or some other) by the usual 
Type S technique. Such a conditioning, as we already know, 
will be generalized the dog will salivate, to a lesser degree, 
when other tones are presented. One stimulus, say the 1000- 
cycle tone, is then chosen as 'positive* (to be followed by 
reinforcement) and another, say the goo-cycle tone, as 'nega- 
tive' (never to be followed by reinforcement). The two 
stimuli are then presented, in haphazard order, on many oc- 
casions. The outcome is a difference in the strength of the 
two reflexes in other words, a discrimination is formed. 

An indispensable part of this procedure of selective rein- 
forcement or "contrasts" lies in the random alternation of 
the stimuli to be discriminated. Too many successive presen- 
tations of either tone alone will not overcome generalization. 
Frequent interchange of 'positive' and 'negative* stimuli is 
essential if a discriminative response is to be established. 

The formation of a discrimination is, then, a double proc- 
ess. Through generalization, each direct reinforcement of 
stimulus A adds to the eliciting power of stimulus B; each 
extinction of stimulus B subtracts a little from the power of 
A. Further reinforcements give more power to A than B; and 
further extinctions take more from B than A. As the stimuli 
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draw apart in their power, a discrimination is in the making. 
The gradual accumulation of differences in the strength of 
the two reflexes is the heart of the process. 

Generalization and Discrimination in Operant 
Conditioning: The "Discriminative Stimulus" 

We now return to our representative operant, bar-pressing. 
You saw, in Chapter 3, that this response was emitted, not 
forced; and that its strength could be increased through re- 
inforcement. You saw >o, in the case of the runway operant, 
that a connection de\ ^.oped between the running behavior 
and the opening of the starting-box door. This door-opening 
stimulation was called "discriminative," rather than "elicit- 
ing," and we are now in a position to make this distinction 
clearer. 

Suppose, in the bar-pressing situation, that we take two 
stimuli, a bright light and a dim light, and arrange an ex- 
periment in which they are turned on alternately within the 
rat's response compartment. The rat has been conditioned 
previously to press the bar for food. He is hungry again and, 
when placed in the box, begins to respond immediately. This 
time, however, the conditions are different. Although he is 
free to respond at any time, we reinforce the response only 
when the bright light is on. 

In such a case, neither stimulus elicits bar-pressing. The 
bright light merely sets the occasion on which reinforcement 
will follow if the response is made; the dim light, on the other 
hand, provides the cue for not responding and sets the occa- 
sion for non-reinforcement. Each is a discriminative stimulus. 
Adopting conventional notation, we may refer to the bright 
light, in the presence of which reinforcement occurs, as S D 
(ess-dee) and the dim light, in the presence of which there is 
no reinforcement, as S A (ess-delta). 

Let us continue the procedure. Periodically, the bright 
light (S D ) comes on and stays on until the animal presses the 
bar. At the response, food is delivered and the light goes off. 
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For five minutes, the light is dim (S A ) and no response during 
that time is reinforced. Then, S D appears again, the next 
response is reinforced, and the cycle is repeatedfor as many 
times as we desire. 

As in the case of respondents, generalization occurs. A rein- 
forcement in the presence of S D increases its effectiveness as a 
cue, but it also increases, to a lesser degree, the effectiveness 
of S A ; an unreinforced response to S A decreases its effective- 
ness and weakens slightly that of S D . Continued alternation of 
S D and S A causes them to draw apart in their evocative power. 
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FIG. 30. The formation of a light-dark discrimination by one animal 
following two sessions of P-R at fixed intervals. (After Skinner, 1938.) 

Here, again, extinction is the hallmark of discrimination- 
responding to S A extinguishes while responding to S D is main- 
tained. A cumulative response curve should therefore reveal, 
by its shape, the formation of the discrimination. That is, it 
should resemble a curve of extinction. It is because extinc- 
tion, as opposed to conditioning, is the important process in 
discrimination, that our experiment provided such relatively 
long periods of S A . 

Figure 30 presents such a curve, made by a single rat, 
during a ten-day period of experimentation in which light 
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was the S D and darkness was the S A , The discrimination, which 
appears as a relatively slow change in response rate under S A , 
followed two days of periodic reconditioning (in the dark) at 
five-minute intervals. One hour of training was given per 
day, and approximately six hours were required for the 
amount of responding in successive five-minute periods to 
reach a minimal value. Responding under S A never disappears 
entirely, and the S D responses are of course included in the 
cumulation; hence, the curve never reaches an operant level 
comparable to that of a simple extinction curve. 

The Effect of Previous Training 

Does the length of time needed to form a discrimination 
depend upon the amount or the kind of an organism's pre- 
vious training? In answering this question, our line of reason- 
ing would be as follows: A discrimination involves extinction; 
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FIG. 31. Average cumulative response curve for four rats showing 
the formation of a light-dark discrimination following 50 regular rein- 
forcements and one period of extinction. (After Skinner, 1938.) 

resistance to extinction varies with the kind and amount of 
training, hence the time required for a discrimination to 
develop will depend upon previous training. In Figure 31, we 
have an average discrimination curve for four rats. Before the 
discrimination began, these animals had received fifty regular 
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reinforcements, followed by a period of extinction. The 
strength of the response was therefore much less than in the 
case of the rat (see Figure 30) that had previously received 
two days of periodic reinforcement when the discrimination 
training was introduced. Figure 31 shows that the discrimina- 
tive process for this group was practically complete by the 
end of the second day. 

An interesting result, not so easily predicted, arises when 
we attempt to teach a discrimination without any previous 
training in bar-pressing. Here, on the first occasion when the 
bar is accessible, the S D is present. The first response is rein- 
forced, the S D is replaced by S A for a five-minute interval, and 
the cycle is then repeated. A sample curve for a single animal 
is given in Figure 32. It shows that the generalization to S A 
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FIG. 32. The formation of a discrimination without previous con- 
ditioning. The vertical strokes over the cumulative response curve indi 
cate reinforcements in the presence of die S D . (After Skinner, 



is almost nil, so that there is nothing to extinguish. We con- 
clude that, under suitable conditions, a discrimination may 
be formed immediately. The basic requirements are that 
neither reflex is strengthened in advance of the training pro- 
cedure, and that the first reinforcements occur in the presence 
ofS D . 

A Human Discrimination Analyzed 

When human rather than animal subjects are taught to 
make discriminations, it is not always easy to identify the 
changes that occur, but an experiment by Hilgard, Campbell, 
and Sears (1938) offers convincing evidence that the same 
fundamental processes are involved in each case. A visual 
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stimulus (a light in the left one of two small windows) was 
presented on sixty occasions to fourteen subjects, each pre- 
sentation being followed by a puff of air to the eyeball (the 
eliciting stimulus for the eye-wink reflex). After this training, 
when the subjects were responding to the light in advance of 
the air-puff about 75 per cent of the time, another stimulus 
(a light in the right-hand window) was introduced and ran- 
domly alternated with the first 
during sixty trials of discrim- 
ination training. The left-hand 
light was positive (always fol- 
lowed by the air-puff) and the 
right-hand light was negative 
(never followed by the air- 
puff). 

Figure 33 shows the change 
in strength of the response to 
positive and negative stimuli 
during the discrimination pro- 
cedure. You will note that the 
responses to S D and S A draw 
apart gradually in their fre- 
quency, and that the major 
change is one of extinctiona 
weakening under S A . Except 
for a slight drop in the S D curve 
at the very beginning of the 
discrimination trials (possibly 

due to a generalization from the extinction of response to S A ), 
the eye-wink occurs with the same frequency as at the end of 
the conditioning trials. 

Abolishing a Discrimination 

If we wish to abolish a discrimination, we must restore re- 
sponding to S A . Two procedures are available, (i) We may 
keep S A present continuously and reinforce every response, 
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FIG. 33. Progress in forming 
a discrimination by human sub- 
jects. The conditioned response 
employed was the eye-wink; the 
stimuli to be discriminated were 
visual. (After Hilgard, Campbell, 
and Sears, 1938.) 
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thus reconditioning the extinguished response. (2) We may 
return to periodic reconditioning at the same interval em- 
ployed in discrimination training but without the use of S D . 
This kind of abolishment gets the response back to its 
original P-R rate. 

Extinguishing a Discriminative Operant 

Abolition of a discrimination is different from extinguish- 
ing the discriminative response. When we abolish, S D and S A 
are equalized through reinforcement of both (responses under 
S A are reconditioned); when we extinguish, the equalization 
of S D and S A is accomplished by the weakening of the response 
to S D . To illustrate the latter, suppose that we have a well- 
formed discrimination, in which S D regularly evokes the re- 
sponse and S A seldom does. The response to S D is, then, our 
discriminative response. It may be extinguished, like any 
other, by withholding all reinforcement in the presence of 
S D . But what kind of an extinction curve would you look for? 
You might expect, at first thought, a great resistance of the 
response to extinction, since S D responding has been rein- 
forced, say, at five-minute intervals, and this is like the pro- 
cedure of periodic reconditioning. On second thought, you 
will see that the response has always been reinforced under 
S D the only non-reinforcement was under S A . The curve of 
extinction to S D should therefore resemble that of extinction 
after regular reinforcement which it does. 

The Meaning of "Similarity" 

British philosophers of the seventeenth and eighteenth cen- 
turies made much of two "laws" through which "ideas" were 
associated: contiguity and similarity. The influence of these 
philosophers is still apparent in our everyday speech, as when 
we say that one idea calls up another because of their likeness 
or because they were once associated in time or place, but a 
modern objective psychology does not find the concept of the 
"association of ideas" very useful. We do find, however, that 
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a contiguity of stimuli, or of stimulus and response, is essen- 
tial to Type S conditioning or operant discrimination. Can 
we give an objective meaning to "similarity" as well? 

If you will pause to consider the matter, you will see that 
"similarity" and generalization are the same thing. In every- 
day affairs, we talk as if stimuli could be similar in themselves, 
but actually their similarity depends upon our own behavior; 
they are similar when, and only when, we make the same sort 
of response to them. Similarity does not reside in stimuli 
alone, any more than in "ideas." 

Stimuli, it is true, may have common physical properties 
and, in a physical sense, are therefore "similar." But when 
people say that things are similar they mean that they tend to 
react to them in the same way. They are really reporting this 
tendency with the words They are similar. This is quite dif- 
ferent from the physical similarity which is often, though not 
necessarily, present when the responses are similar. 

Related to this point is an experiment by Plotkin (1913) on 
the learning of International Morse Code by college students. 
Before their instruction began, these students were presented 
with pairs of code signals and asked to rate them on the degree 
of their "similarity." Later, during the actual code training, 
lie found that the confusions of signals with each other (gen- 
eralizations) were directly related to their previously esti- 
mated "similarity." For example, the signals for the letters C 

(-. .) and Y ( . ), which were commonly generalized 

during training, had already been judged to be very much 

"alike"; whereas, the signals for A (. ) and O ( ), which 

had been rated as dissimilar, were never confused with each 
other. The identity of "similarity" and generalization was 
quite clearly indicated, and gives weight to Hull's (1913) 
statement that "the common-sense notion of similarity and 
difference is based upon the presence or absence of ... 
generalization." 

When two or more stimuli are found to generalize, so that 
a response conditioned to one is made to the others of the 
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group, we may of course try to identify the property that they 
have in common. It is sometimes difficult, or even impossible, 
to do this. Stimuli are known to acquire their functional 
equivalence from several sources. We may, for example, gen- 
eralize two visual stimuli on the basis of wave-length, energy, 
size, shape, or location. Also, if they are composite stimuli, 
they may generalize through the identity of their parts. Ap- 
parently, too, they may generalize on the basis of the emo- 
tional or other responses that they arouse within us (as in the 
case of "mediated" generalization, to be mentioned later). 
These and other factors play a part, singly or in combination, 
in determining the degree to which stimuli are equivalent in 
evoking response. You may well imagine that the problem of 
identifying them in any given case is one that challenges the 
best efforts of specialists in the field of discrimination. 

The concept of generalization has been extended to more 
complex problems of human behavior. Psychiatrists and clin- 
ical psychologists often encounter striking instances of re- 
spondent generalization, in which emotional upsets originally 
connected with a single event in a patient's history come to 
be evoked by stimuli commonly met in the daily routine of 
living. In one case, a phobia that prevented a woman from 
looking into the eyes of anyone, whether friend or stranger, 
was traced to a single experience. Some years before, when 
surreptitiously opening the drawer of a chest in the home of 
a blind aunt, she was confronted with a pair of staring glass 
eyes. The panic induced later by the sight of any eyeball 
stemmed from this "trivial" but "similar" incident. 

Up to this point we have emphasized the strengthening 
action of generalization. But the opposite effect may also 
occur. Experiments show that reflexes, strengthened through 
generalization, may be weakened when the originally condi- 
tioned reflex is extinguished. Also, if a number of different 
stimuli, say a tone, a touch, and a sound, are attached to the 
same response through conditioning, the extinction of any 
one of them will decrease the strength of the rest. 
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Into the details of such experiments we need not go, but it 
may be mentioned that psychiatrists, again, have apparently 
utilized this principle. A number of their curative techniques, 
as when they probe repeatedly with verbal stimuli into the 
lives of their patients, clearly provide for the extinction of 
emotional respondents. Through generalization, the emo- 
tional responses to these upsetting stimuli when encountered 
outside of the clinic are also extinguished to some degree in 
spite of obvious differences in the stimulus situation. 

Generalization Gradients 

Generalizing stimuli may be classed in terms of the sense- 
organs involved visual stimuli, for example, involve the 
function of receptors in the eye. Within each of these 'sense 
departments,' stimuli may form series, like the series of 
pitches, colors, loudnesses, brightnesses, warmths, and dis- 
tances apart on the skin surface. Now, if we inquire how, 
after any one member of a series has been made a conditioned 
stimulus, the effects of generalization spread out over the 
rest, we discover an interesting fact: there is a gradation of 
effect, depending upon the physical proximity of the stimuli 
to the one conditioned. 

Pavlov (1927) noted this early in his studies of generaliza- 
tion: "If a tone of 1000 d.v. is established as a conditioned 
stimulus, many other tones spontaneously acquire similar 
properties, such properties diminishing proportionally to the 
intervals of these tones from the one of 1000 d.v." (The 
italicizing is our own.) This gradation effect has led to the 
term generalization gradient. 

In addition to a gradient for strengthening, Pavlov also 
reported one for extinction, obtained from the following 
experiment. Along the hind leg of a dog, five spots were 
selected, the first at the paw and the others spaced three, nine, 
fifteen, and twenty-two centimeters from the first. By direct 
reinforcement, the four spots were equalized in their effec- 
tiveness as conditioned stimuli for salivation, while the paw 
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spot was extinguished. When this state was reached, the paw 
spot was given three more unreinforced stimulations, and 
the other four spots were separately tested. The generalized 
extinction was greatest for the three-centimeter spot and de- 
creased progressively to the twenty-two-centimeter spot. 
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FIG. 34. Average generalization gradient for tonal pitch obtained 
from 10 human subjects. The conditioned response was the galvanic 
skin response, labeled PGR (psychogalvanic reflex) on the ordinate; 
the US was electric shock. The response was first conditioned to a tone 
of 1967 c.p.s., and generalization was tested to tones of 1000, 468, and 153 
c.p.s. The abscissa is logarithmic in scale to keep the figure down in 
size. (Data from Hovland, 19373.) 

Both conditioning and extinction generalization gradients 
have been found for human respondents by American inves- 
tigators (Bass and Hull, 1934; Hovland, 19373). Moreover, 
gradients for discriminative stimuli have been demonstrated 
in studies of operant behavior. Frick (1948) put five groups of 
rats on a discriminative training schedule in which S D and S A 
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were alternately present for two and a half minutes each. S D 
and S A were lights of different brightness, and each group of 
animals was trained with a different pair, as shown below: 



Brightness 

(in foot-candies) 



CROUPS 
I II III IV V 

S D 20 2O 2O 2O 20 

S A \o 7.5 5 .02 "Dark" 



Frick's expectation was that die ease of forming the dis- 
crimination would depend on the difference between S D and 
S A ; and his results bore out the hypothesis. Both the rapidity 
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FIG. 35. Average cumulative curves for S^ responding by the several 
groups in Frick's experiment. The curve for S-J is linear and does not 
differ from the P-R rate before discrimination training began (i.e., the 
difference between S D and S^ was not discriminate). The separation 
of the curves is an indication of the varying degrees of generalization. 
The curves are composites of a number of experimental periods. (After 
Frick, 1948.) 
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with which responding to S A extinguished and the complete- 
ness of the extinction were increased as the difference between 
S D and S A grew larger. Similar results have been secured by 

Raben (1949), who used 
the Graham-Gagne runway 
and a technique developed 
by Verplanck (1942). 

Evidence for generaliza- 
tion gradients with human 
subjects and discriminative 
stimuli was collected, in 
1939, by Gibson. In one 
experiment, the S D was a 
vibratory stimulus, applied 
to the subject's back. After 
a verbal response (a spok- 
en nonsense syllable, duf) 
had been connected with 
an S D at one spot on the 
skin, the vibration was pre- 
sented at spots four, eight, 
and twelve inches distant 
in a straight line down the 
subject's back. Each sub- 
ject was instructed to re- 
spond with the syllable 
only when the stimulus 
was felt at the initial posi- 
tion. Generalization was 
measured in terms of the 
percentage of dut responses to the stimulus at each of the 
other positions. The average percentage values obtained for 
the four points were, respectively, ninety-eight (at the 'con- 
ditioned' spot), twenty-five, fourteen, and nine, indicating a 
steep but continuous gradient (Fig. 36). 
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FIG. 36. Average generalization 
gradient of tactual stimuli varying 
in distance from a conditioned spot 
on the skin of human subjects. 
(After Gibson, 1939.) 
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The Study of Discriminative Capacity 

A proper understanding of discrimination must develop 
along two lines: first, an analysis of the process of discrimina- 
tion and the factors influencing this process; secondly, the 
investigation of the discriminative capacities of organisms. We 
have already considered the former, and we now focus our 
attention briefly upon the latter. Again we shall deal with 
matters of general validity, not attempting to make an inven- 
tory of all the sensory capacities throughout the animal scale, 
or to catalogue the changes in capacity accompanying the 
individual development of an organism. Our emphasis will 
be upon the behavioral aspects of the problem, the experi- 
mental methods used, and a few of the more significant 
concepts. 

i. Capacity and the threshold. It is possible to alter an 
organism's environment in a way that has no effect upon its 
behavior. It is easy to pick a physical change to which an 
organism cannot be brought to respond at all, or we can 
choose two stimuli which it will not discriminate no matter 
how long we continue selective reinforcement. Thus, neither 
a human being nor a dog would ever respond to a touch on 
the skin as faint as a dust particle, nor could they ever dis- 
criminate a tone of 1000 cycles from one of 1000.01 cycles. 
The problem of capacity arises from these elementary facts. 

The measurement of sensory capacity reduces to two deter- 
minations, both of which were mentioned in Chapter i: 

a. The smallest value of a stimulus to which a response 
can be made. Such would be the softest sound, the faintest 
light, the lowest pitch, the lightest touch. In discriminative 
terms, it would be the least stimulus which can be discrim- 
inated from none at all, so that a response cannot be condi- 
tioned to a lesser value. As noted in Chapter i, these least 
values have been called "absolute thresholds" or "absolute 
limens." 

b. The least difference between two stimuli, each above 
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the absolute threshold, that can be discriminated. Two 
stimuli may be so close together that, behaviorally, they are 
identical; the difference between them must reach or exceed 
a certain value before a response to one but not the other 
may be established. An extension of the metaphor of "thresh- 
old" led naturally to the designation "difference threshold" 
or "difference limen." Both types of threshold are stated in 
terms of some physical measure of the stimulus, whether 
energy, frequency, pressure, temperature, or some other. 
Stimulus values may be thought of as "supra-liminal," just 
"liminal," or "subliminal" for both types of threshold. Never- 
theless, it should be kept in mind that the meaning of thresh- 
old lies in an organism's behavior. We define in physical 
terms those stimuli which are or are not adequate for dis- 
criminative responding. The behavioral data are always prior 
and necessary to the concept of capacity; and stimuli which 
are not discriminated even after arduous training are held to 
be below capacity level. We speak as if stimuli can or cannot 
be discriminated because of the limitations of capacity, but 
we intend that capacity be defined by the behavioral evidence 
of stimulus discriminability. 

2. The 'psychophysicaV methods. Historically, the study 
of human sensory capacities was the first to get under way in 
psychology with an adequate set of methods. By 1860, Gustav 
Fechner, an outstanding German scientist of his day, had 
formulated several experimental procedures which are still 
the backbone of human sensory research. Designed to test the 
limits of discriminative capacity, these methods aim to deter- 
mine both absolute and difference thresholds. Fechner him- 
self thought of them as a means of determining the relation 
between "mental sensations" and the "physical world" (hence 
"psychophysics"), but we see them today, not as a solution to 
a philosophical problem, but as important contributions to 
the measurement of sensory capacities. 

The psychophysical methods differ in their complexity, 
their usefulness under given circumstances, and in the type 
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of threshold they measure. Because Fechner realized that a 
subject's sensitivity varies from moment to moment, the 
methods each provide for measurements to be repeated as 
often as desired in order to secure reliable average estimates 
of threshold. They are alike, too, in presenting to a subject 
either single or paired stimuli. The subject is asked to report 
whether he can perceive the stimulus at all, or whether he can 
tell that two stimuli are different. Repeated stimulus presen- 
tations and statistical treatment of responses finally yield an 
estimate of his threshold and the consistency of his responses, 
both of which are measures of his accuracy or capacity. 

The psychophysical methods are unquestionably endorsed 
by common sense. They are characteristically human in their 
reliance upon verbal behavior. They do not ask how such 
behavior is acquired and how words come to be discrimina- 
tive responses. Nothing seems more reasonable on the surface 
than to ask a person whether he can or cannot perceive a 
stimulus or a stimulus difference. In this case, however, as in 
many others of unanalyzed human intercourse, common 
sense does not reveal the actual complexity of the experiment 
or the tacit assumptions upon which we proceed. The very 
feasibility of the methods, in which 'instructions' to the 
subject are indispensable, depends upon a long history of 
verbal behavior in both experimenter and subject which 
circumvents the need for training a new discriminative re- 
sponse such as would be necessary with lower organisms. The 
former "asks" the latter to "observe," "perceive," "pay atten- 
tion," "judge," "cooperate," and so on; and the subject's only 
reinforcement is "social approval." Truly, there is material 
for analysis here. Yet, despite their naivete, the psychophys- 
ical methods give a very fine measure of human sense acuity. 
The reliance upon verbal behavior is amply justified by the 
lawfulness and dependability of the results. 

With human or animal subjects, and with respondent or 
operant methods, threshold determinations are subject to the 
influence of many important variables. These experimental 



GENERALIZATION AND DISCRIMINATION 133 

situations include far more than the stimuli to be discrim- 
inated or the receptor organ being tested. Variables within 
the organism and in the experimental situation may count 
heavily in determining results. Hence, it is always premature 
to conclude that the absolute or difference threshold obtained 
is a final and immutable value. A failure to discriminate with 
one method does not mean that the organism would fail with 
a different method. Human subjects, for example, may im- 
prove their thresholds when given information on their per- 
formance during the experiment. Only recently, too, the use 
of a conditioned respondent has revealed for the first time 
the existence of color vision in the rabbit, when other 
methods had failed (Brown, 1936). Such instances will un- 
doubtedly occur in the future. 

3. Some examples of human acuity. Studies of human re- 
ceptor sensitivity have turned up some dramatic tacts. Our 
senses are really much more acute than we commonly imag- 
ine. Indeed, they frequently prove to be more responsive to 
small environmental energies and changes than most of the 
modern physical instruments which we regard with awe. 
Under good experimental conditions, a person with normal 
eyes can tell two colors apart when they differ by as little as 
two or three millimicrons in wave-length (a millimicron is a 
millionth of a millimeter); the brightest light which permits 
ordinary good vision may be 2.00 x io 8 times as bright as the 
absolute threshold value; and a wire 3/32 of an inch thick 
can be seen at a distance of half a mile. 

Hearing is initiated by the impact of sound waves against 
the ear-drum. It has been computed that the faintest sound 
we can hear arises from impacts which do not greatly exceed 
in force the casual collisions of air molecules against the 
drum. With a little more sensitivity, we could actually "hear" 
the molecular movement of the air. As for pitch, some sub- 
jects can, with 100 per cent certainty, discriminate two tones 
which, at the 2OO-cycle level, are separated by only one-eighth 
to one-half of a cycle. 
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The sense of smell is also extremely acute, despite populai 
misconceptions. One sniff suffices to tell the presence of a 
substance like mercaptan when the amount of material in- 
volved is much less than can be picked up with a spectroscope. 
For odorous gases injected into the nasal passages, a rise in 
pressure of a few millimeters of mercury is enough to make 
some subjects unfailingly distinguish the smell where before 
they were completely insensitive. 

In some of our sensing, we are excelled by lower animals, 
like the dog; in others, we stack up quite well against our 
biological contemporaries. This is true, however, only for 
acuities measured under rigid experimental controls. In 
everyday life, our senses function under crude conditions and 
we do not usually have occasion to fall back upon their ex- 
treme capacities. For these reasons, we do not learn our full 
potentialities until the laboratory tells us its story. 

4. Other methods. Since the capacities of infra4iuman 
organisms cannot be gauged by way of verbal responses, we 
resort to the basic procedure of forming a discrimination. 
For example, using a respondent like the salivary reflex, we 
begin with two visual stimuli and the method of contrasts. 
The "positive'* stimulus is a luminous circle, the "negative" 
an ellipse. The ellipse has a minor axis equal to the circle's 
diameter, but the major axis is twice that length. The dis- 
crimination between the two stimuli is easily formed. Then, 
by small steps, we bring the ellipse nearer and nearer to the 
shape of the circle, reinforcing regularly the response to the 
latter and never reinforcing the response to the former. At 
some point in the progression, the major-minor axis ratio of 
the ellipse will be such that the animal no longer responds to 
it any differently than to the circle that is, we come to his 
difference threshold. Pavlov (1927) found this ratio to be 
about 9:8. By a similar procedure of selective reinforcement, 
we may determine the absolute threshold (e.g., for a sound) 
by seeing how small a stimulus magnitude can be made into 
a conditioned stimulus. 
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The basic experiment for operant discrimination may also 
be used in determining thresholds. The difference limen is 
obtained by bringing S D and S A to the point where, with a 
slightly smaller difference between them, the cumulative 
response curve shows exactly the same properties as the ordi- 
nary P-R curve. That is, extinction to S A does not occur and 
the reinforcement of only S D responses sets up a periodic 
cycle. The absolute threshold is determined when we find the 
least value of S D which, when paired with an S A of zero mag- 




FIG. 37. One of Elder's subjects, earphone adjusted over left ear, 
in position for tests. The experimenter, who is behind the screen, 
raises the inclined sliding door at the ape's right, exposing the telegraph 
key. When the key is correctly pressed by the subject, his reward is 
delivered automatically through the chute at the left. (After Elder, 



nitude, still permits a discrimination or, to turn it around, 
the greatest value of S D which, with S A equal to zero, gives 
only a P-R curve. (This method, as well as the preceding one 
for respondents, tells us how the difficulty of forming a dis- 
crimination changes as the stimuli approach liminal values.) 
Variations of the operant procedure have been used in 
studying the discriminative capacities of organisms higher in 
the scale than the rat. Elder (1934), for example, determined 
the absolute threshold for tonal intensity in several chim- 
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panzees by having them press a telegraph key whenever the 
"positive" stimulus sounded in their ear-phones. A "ready 
signal" (the opening of a door which gave access to the key) 
preceded exposures to S D and S* (silence). The S D was pre- 
sented until a response occurred, after which the door to the 

reaction key was closed and 
the ape was given a bit of 
fruit; S A was presented for 
short periods only eight 
seconds of silence following 
the signal, or until the sub- 
ject responded "falsely" 
after which the door to the 
key was closed again. An 
equal number of S D and S A 
presentations were provided, 
in random order, and thresh- 
old determinations were 
based on the percentage of 
correct responses at different 
intensity levels. The thresh- 
old values for seven tones, 
ranging in frequency be- 
tween 128 and 8192 cycles 
per second, showed that, on 
the whole, the apes were 
equal in their sensitivity to 
human beings of a compar- 
able age group. 

A more complicated form 




FIG. 38. The Lashlcy jumping 
apparatus. 5 animal's stand; n net; 
f.p.lood platform. (Reprinted 
from Crafts, Schneirla, Robinson, 
and Gilbert, Recent experiments in 
psychology, as redrawn from Lash- 
ley, 1930. Copyright, 1938. Courtesy 
of McGraw-Hill Book Company.) 



of operant technique comes 
under the heading of "choice-reaction." There are several 
variations of this technique, but none is intended to give, nor 
does it give, a picture of the discriminative process. The 
stimuli to be distinguished are simultaneously presented, and 
one observes merely the all-or-none event on each trial in 
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which the response is made either to S D or S A . A growing 
preponderance of responses to S D as against S A indicates an 
emerging discrimination. The technique is adequate to show 
whether a discrimination is possible, hence to supply a meas- 
ure of threshold. 

Two rather widely employed examples of the choice-reac- 
tion method are the "jumping apparatus" (Lashley, 1930) 




FIG. 39. An apparatus for teaching a visual discrimination. The 
animal starts at S; d is the choice point; g and #' are electric grids; a and 
a' are doors to the food alleys / and /'; /. is the light box; the stimuli to 
be discriminated are the differently illuminated round windows. (Re- 
printed from Crafts, Schneirla, Robinson, and Gilbert, Recent expert- 
merits in psychology, as redrawn from Lashley, 1929. Copyright, 1938. 
Courtesy of McGraw-Hill Book Company.) 

and the "T-box" (Yerkes and Watson, 1911). In the first, a 
hungry rat is taught to jump from a ledge through either of 
two identical loose curtains, behind which it lands on a plat- 
form and is reinforced with food. After conditioning, the 
curtains are changed so that they bear different stimuli, such 
as a circle versus a square. A jump to the arbitrarily 'correct* 
stimulus carries him through the curtain to food; an 'incor- 
rect' jump dashes him against a door which is closed behind 
the curtain and drops him into a net. The stimuli may be 
equated in all respects (brightness, color, etc.) save the one 
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to be discriminated (shape), and are frequently interchanged 
in position to avoid the strengthening of response to irrel- 
evant cues or the growth of a 'position habit/ The method 
could, of course, be used without employing negative rein- 
forcement for the 'wrong' response but simply omitting posi- 
tive reinforcement. 

The T-box is essentially a bifurcating alley in which the 
animal starts at the foot of the shaft, moves down, and turns 
right or left into one of two end-boxes, one being correct in 
that it contains food, and the other incorrect in that it leads 
to no reinforcement or to negative reinforcement like electric 
shock. The correct turn is signalled to the animal by the S D 
in use, and, again, care is taken to avoid mere position habits. 
As in the case of the jumping technique, some criterion of 
successive correct choices is taken as the measure of dis- 
crimination. 

Both the T-box and the jumping technique may be em- 
ployed in studying the effect of various factors (brain opera- 
tions, drug injections, motivation, past history, etc.) upon 
discriminative capacity, but the T-box has had the greater 
usefulness. It permits the testing of more sense functions and 
a wider variety of animals. In addition, it may be noted that 
the T-box is essentially a one-choice maze (a simple T- 
maze) and is suitable for at least the exploratory investiga- 
tion of still other influences upon operant behavior. 

Discrimination and "Experimental Neurosis" 

Much attention has been paid in recent years to aberra- 
tions of behavior which appear when animals are forced to 
their discriminative limits. Pavlov (1927) observed, in the 
ellipse experiment mentioned above, that when his dog 
reached the 9:8 ratio of axes, an attempt to push this dis- 
crimination completely disrupted the dog's behavior. He 
became violent, bit the apparatus, whined, and barked; in 
the investigator's opinion, he presented "all the symptoms of 
acute neurosis/' One of Pavlov's students obtained a similar 
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effect in a pitch-threshold experiment, a visiting psychiatrist 
agreeing that the animal was "neurotic." These induced dis- 
organizations are now called "experimental neurosis," and 
have since been established in rats, cats, sheep, pigs, and even 
human beings. They point to a closer integration of psycho- 
pathology with the theories and laws of general behavior. 

Experimental neurosis may be obtained in both respond- 
ent and operant discriminative conditioning. In the latter, 
the disruption of behavior is accentuated if, in addition to 
going unreinforced, the responses to S A are punished. Forcing 
a response in the jumping apparatus by goading the animal 
with a blast of air or pushing him off the ledge, may have this 
result. When negative is combined with positive reinforce- 
ment in this fashion, the amount of "conflict" increases as 
the threshold is neared or the difficulty is increased. There is 
no choice but to respond, and a "wrong" response is costly in 
terms of punishment. 

The symptoms of experimental neurosis include refusal to 
work, excitement, cowering, disturbances of breathing and 
heartbeat, irregularities of activity and rest cycles, rigidity, 
trembling, convulsions, and spasmodic muscular twitchings 
('tics'). Oftimes, the animal will seem normal when outside 
the experimental situation, but the abnormalities erupt as 
soon as it is put back. In some instances, the disorganization 
is carried over into the animal's routine environment (Lid- 
dell, 1938), and may slowly disappear with prolonged vaca- 
tion, only to emerge with full force when the experiment is 
resumed. Once set up, the neurosis may not only affect the 
discrimination of threshold stimuli, but an even cruder dis- 
crimination (e.g., an ellipse ratio of 2:1) may be disturbed. 

Human beings cannot, for obvious social reasons, be sub- 
jected to extreme experimental neurosis. In Pavlov's labora- 
tory, however, small degrees of upset were produced in a 
child of six by requiring a discriminative motor response to 
metronome beats. Rates of 144 and 92 beats per minute were 
easily discriminated, but after narrowing down the difference 
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to 144 and 120 per minute, the child became surly, disobedi- 
ent, and uncooperative. Other children, organically impaired 
by encephalitis or cretinism, were also brought to the point 
of mild behavioral disruption by being required to make 
overly fine discriminations. Such manifestations are not un- 
like those sometimes observed when human adults are 
repeatedly compelled to attempt delicate discriminations in 
psychophysical experiments or in certain occupational train- 
ing. Students of radio code, for example, have been known to 
show deviations from normal behavior which, in extreme 
cases, warranted a psychiatric classification of "code neurosis/' 
It is still too early to judge the implications of this line of 
research for the whole range of human maladjustments. To 
many psychologists, it seems a good beginning. The precip- 
itating causes of human abnormality often appear similar in 
principle to the experimental observations. The work is be- 
ing actively pressed in many laboratories, by psychiatrists as 
well as psychologists. Phobias, compulsions, anxieties, and 
other long-recognized disorders are fast becoming the concern 
of the experimentalist as well as the therapist and the clinical 
investigator. We may look forward with some optimism to a 
day of mutual aid and understanding between workers in 
these two historically unrelated fields. 

Discriminative Reaction Time 

In our study of discrimination, we are interested in how a 
response depends upon stimulus conditions. When we come 
to analyze this dependency, several measures of response are 
possible. For example, there is response frequency, in the 
now familiar bar-pressing situation, when animals are in- 
volved. In psychophysical experiments, we use the frequency- 
of-detection of stimuli or stimulus differences. Latency of 
response affords another measure one which has historically 
been called reaction time. 

How fast can a person react to a discriminative stimulus? 
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This question is ordinarily answered in the laboratory by the 
following procedure. We instruct a subject to respond as 
quickly as he can; we give him a 'ready signal' so that he can 
get 'set'; and we deliver the stimulus. We may use a light, a 
sound, a touch, or any other stimulus we wish; and the sub- 
ject's response may be pressing a telegraph key, calling out a 
word, or any other operant that we can conveniently record. 
The time between the stimulus and the response is his "reac- 
tion time/' 

The study of reaction time was not launched by psycholo- 
gists. During the nineteenth century, before the advent of 
modern registering devices, astronomers were vitally con- 
cerned with this matter. In attempting to note the exact 
moment at which a star passed the meridian, they had to 
observe its movement and respond immediately when it 
crossed a hairline in the objective of a telescope. Over the 
space of many years, they struggled with the problem of cut- 
ting down discrepancies in the observations of different per- 
sons; and they bequeathed to us the expression, "personal 
equation," for individual differences in reaction time among 
various observers. Also, H. L. F von Helmholtz, a brilliant 
investigator in physics, physiology, and other fields, at- 
tempted, in 1850, to determine the speed of nerve-impulse 
conduction in human beings, with a reaction-time method 
modeled after one he had employed on a nerve-muscle prep- 
aration excised from a frog's leg. A subject's skin was stimu- 
lated at points differently distant from the brain, and to each 
stimulus he was instructed to respond as quickly as he could. 
By dividing the difference in estimated length of nerve for 
two skin spots by the difference in the two reaction times, 
Helmholtz arrived at an approximate speed of conduction. His 
final figure of about sixty meters per second was surprisingly 
close to currently accepted speeds (about sixty-eight meters 
per second), although his data were so variable that he would 
not trust them. 
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It was not until 1868, however, that Bonders, another 
physiologist, pointed out that several types of reaction time 
might be investigated. He saw that increasingly complex 
stimulus and response alternatives should have their effect 
upon reaction time, and he set down what he thought were 
the three typical experiments. 

1. The A-reaction. The subject makes only one specified 
response, say pressing a key, and a single stimulus is used each 
time. This is the so-called simple reaction time. It has been 
used as an index of the effect of drugs and fatigue; in com- 
paring response speeds with stimuli belonging to the dif- 
ferent senses; in determining the importance of stimulus 
intensity; and under other experimental conditions. 

2. The B-reaction. Two stimuli are employed, each with 
its own appropriate response. For example, a subject may be 
asked to respond with his right hand when shown the color 
green, and with his left when shown red. On successive trials, 
the stimuli are interchanged at random. The subject, on each 
occasion, discriminates which is being presented, reacting 
with the appropriate hand. This is sometimes called the "dis- 
junctive" reaction time, because of its either-or character- 
that is, the subject responds in one way or the other. 

3. The C-reaction. This is another disjunctive reaction 
time, sometimes known as the "discrimination" reaction 
time. Two stimuli are used, and the subject must discrim- 
inate by responding to one, but not to the other. This is 
essentially the discriminative situation that we described 
earlier in connection with bar-pressing and Elder's (1934) 
study of auditory acuity in the chimpanzee. 

If you think this over, you will probably conclude that the 
order of increasing complexity in Bonders' types is not A-B- 
C, but A-C-B. The C-reaction adds a discriminative factor to 
the A-reaction; and the B-reaction adds a response factor to 
the C-reaction. The average reaction-time values for the three 
cases come out as you might expect: A is fastest, C is inter- 
mediate, and B is slowest. 
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Other investigators went further. It seemed reasonable that 
two things should hold. In modern terminology, we would 
say that (i) reaction time might be used as a measure of 
degree of stimulus generalization; and (2) reaction time 
should increase with the number of different responses 
brought into play. The correctness of both assumptions has 
been well established. In one series of studies, Henmon 
(1906) studied the B-reaction, using pairs of colors, tones, 
and lines. Members of the pairs were different in varying 
degree. One pair of lines, for example, might be nearly the 
same in length, whereas another might be quite different. 
The subject was instructed to respond to a specified color or 
line of the pair with the right hand if the stimulus was on 
the right, and with the left hand if the stimulus was on the 
left. With the tones, a right-hand reaction was called for if 
the second of the two tones was higher in pitch, and a left- 
hand reaction if lower. The kind of results obtained are 
typified by those shown below. 

Stimuli to Be Average Reaction 

Discriminated Time (milliseconds) 

Tones differing by 16 cycles 290 

"12 " 299 

" 8 " 311 

" 4 " 334 

The importance of increasing the number of responses 
(better, stimulus-response units) has been demonstrated in 
several ways. One experimenter (Merkel, 1885) used a sepa- 
rate stimulus and reaction key for each finger. The stimuli 
were five Arabic and five Roman numerals, visually pre- 
sented, and each was associated with the reaction of a specific 
finger. In the simplest situation, only one stimulus and only 
one finger movement was employed; in the most complex 
situation any of ihe ten stimuli might appear and the appro- 
priate reaction had to be made. The average reaction times 
in the ten different cases were as follows: 
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Stimulus-Response 


Average Reaction 


Units 


Time (milliseconds) 


i (simple reaction) 


187 


2 




316 


3 




3 6 4 


4 




434 


5 


(disjunctive 


487 


6 


- reactions) 


532 


7 




57 


8 




603 


9 




619 


10 




622 



Merkel's experiment, and possibly Henmon's, have a bear- 
ing upon another problem, that of response generalization 
("induction"), to be discussed in the following chapter. The 
evidence is clear, however, that discriminative difficulty, aris- 
ing from stimulus generalization, is reflected in reaction time. 
The greater the generalization, the slower the reaction time. 

Reaction Time and Operant Latency 

Every reaction time is an operant latency. More specifi- 
cally, it is the minimal latency of an operant response to a 
discriminative stimulus. As in the case of threshold deter- 
minations, the measurement of minimal human latencies is 
enormously facilitated by previous conditioning to verbal 
discriminative stimuli. The subject is instructed, he under- 
stands, and he cooperates by doing his best. He has long been 
reinforced for rapid response when such requests or their 
equivalents form a part of the discriminative-stimulus com- 
pound. It is even possible that, finding himself in a labora- 
tory, face to face with an experimenter, and so on, he may 
not even require "instructions" to respond with all speed. 
He "sets himself," we say, for the task; he "assumes" that it 
is the quick response that is wanted. 

The animal usually behaves otherwise. Reduced latencies 
are ordinarily obtained only after considerable training. But 
the basic procedure is the same. The reinforcement of a re- 
sponse is made contingent upon the occurrence of that re- 
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sponse within a given time after S D is presented. We start 
with the animal's normal range of latencies and get him to 
speed up by withholding reinforcement for the slowest 
responses. By gradually eliminating the longer latencies 
through extinction, while continuing to reinforce the shorter 
ones, we ultimately reach a latency that is minimalbeyond 
which selective reinforcement is no longer effective. We have 
then determined his reaction time for this S D . The process 
may have required several hours, distributed over several 
days, whereas a human reaction time could have been de- 
termined in a single experimental session, but the outcome 
is the same; in each case we have achieved a minimal latency 
through selective reinforcement. 

Latency and the Discriminative Process 

The normal operant latency of response to an S D in the 
bar-pressing situation may fluctuate considerably from one 
stimulus presentation to another, depending at least in part 
upon the animal's location and ongoing activity at the time 
of stimulus onset. One might, however, expect that, in form- 
ing a discrimination, there would be a change in latency of 
response to S D and S A , the former decreasing, the latter in- 
creasing, as training progresses. If this were so, we would 
have another way of analyzing the process besides that pro- 
vided by the cumulative response curve. 

Unfortunately, the findings on this score have not been 
very helpful. While a change in latencies does occur in the 
bar-pressing experiment, it is quite small, it comes in very 
early, and it stops long before the discrimination is firmly 
established. In other studies, where a running operant is 
used, the situation is apparently different. In teaching a dis- 
crimination with the runway apparatus, the end-box contains 
a reinforcement only if an S D (e.g., light over the runway) is 
present, and S A (no light) runs are not reinforced. On succes- 
sive S D trials, the animal starts out more and more quickly 
after the release door is opened; whereas interspersed S A trials 
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give progressively slower starting times, increased vacillation, 
and more instances in which the animal does not leave the 
starting box at all. The technique is a trial-by-trial affair, 
like that of the T-box and the jumping apparatus, with no 
free operant to be observed in process of change; but, if we 
treat starting times as operant latencies, we do observe a 
trend of times which is somehow related to the formation of 
the discrimination. The problem of bringing a latency meas- 
ure into closer harmony with a rate measure is yet to be set- 
tled. 

Workaday Reaction Times 

In everyday behavior, most of our responses are not 
emitted under such favorable conditions as those of the lab- 
oratory; and the demand for high-speed reactions is made 
only occasionally, as in athletics, military combat, and the 
control of such machines as the airplane and the automobile. 
For several reasons, the values reached in human reaction- 
time experiments are seldom approximated, even in these 
pursuits. Warning signals are often lacking; one may not 
know from what direction the stimulus is coming; one is usu- 
ally engaged in doing something else when the response is 
suddenly demanded; the response may involve the action of 
large, rather than small, muscle groups; the stimulus may be 
very weak, or so strong as to cause "freezing*'; and so on. 
Thus, a group of football players in uniform may average as 
much as 400 milliseconds in getting off a scrimmage line at 
an auditory signal (Miles, 1931); and the reaction time of 
automobile drivers may rise to several seconds when an im- 
pending accident requires a shift of the foot from gas pedal 
to brake. 

Individual differences in reaction time have long been rec- 
ognized, and attempts have often been made to use these dif- 
ferences as a basis for selecting men to be trained in special 
skills. If one could predict the success of radio operators or 
fighter-plane pilots from their performance on a short speed- 
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of-response test, then one would pick only the fastest men for 
schooling and thereby save the effort and expense of training 
those men who would ultimately "wash out" or become com- 
bat casualties. When used for such purposes, however, re- 
action-time tests have proved of indifferent value, probably 
because other factors (discriminative capacity, motivation, 
adequate training procedures, etc.) are also important for 
success in these occupations. 

Multiple Discriminations 

Although an analysis of discrimination necessarily begins 
with simple cases, our behavior is usually guided by stimulus 
combinations that are quite complex. Many S D 's may some- 
times operate in succession and we may be unable to respond 
to all of them properly. At other times, they may operate 
simultaneously and our response is to the compound rather 
than any single element. Again, a response may be made to 
one or a group of S D 's which are constantly changing. In any 
or all of these cases, the basic processes of discrimination re- 
main as we have outlined them, but their working out often 
presents difficult problems. 

i. Successive discrete S D 's. We can illustrate the first type 
of discriminative complexity by what is encountered when 
one tries learning to receive International Morse Code. As 
everyone knows, this code consists of short and long sounds 
("dots" and "dashes") which are combined in various ways 
to form distinctive patterns. A beginner needs to learn thirty- 
six of these patterns or signals twenty-six for the alphabet, 
plus ten for the numbers 0-9. His job is to become able to 
discriminate each well enough so that he can write down, or 
"copy," the appropriate letters or numbers as the signals are 
sounded one at a time and in random order. At first, many 
of the signals sound alike, and the student puts down the 
wrong character. These are obviously cases of generalization. 
A table of errors made for a class of beginners in code will 
show that these generalizations fall within certain categories 
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depending upon the dot-dash composition of the signals 
(Spragg, 1943; Keller and Schoenfeld, 1944). The problem 
for the learner is very largely a discriminative one, since his 
responses the writing or printing of letters and numbers- 
are usually well established ("differentiated") long before he 
hears code. 

An example of the sort of generalization that occurs in 
code mastery has already been given earlier in this chapter 
when we considered the concept of "similarity." In Table V 
below are given several other instances of the kind of con- 
fusion that makes trouble for beginners. These are typical, 
but not the only ones that occur. Most of the signals actually 
generalize with several others. Thus, the signal for P may 
generalize not only with the one for J, but also with the ones 
for F, L, C, (..), Q, X, and perhaps a half dozen more 
in the early stages of training. Moreover, this generalization 
is reduced by training less readily for some signals than for 
others. Long after W, P, and F are discriminated, 5 and 6 
may give trouble by generalizing with H and B respectively, 
4 (....) will continue to generalize with V (. . . ), and 
so on. 

Table V 

GENERALIZATIONS COMMONLY MADE BY BEGINNERS IN 
LEARNING TO RECEIVE INTERNATIONAL MORSE CODE 

Signal is generalized with Signal 
(W) 

- (J) 
(3) 



(F) 
(*) 
(5) 



(H) 



(Q) (Z) 

(X) - (B) 

(6) - (B) 

As with all discriminations, the speed of learning code can 
be increased by making reinforcement as immediate as pos- 
sible for the correct discriminative response. (For more on 
delayed reinforcement, see pp. 208-209). This fact is utilized 
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in the ' 'code- voice" method of teaching now employed by the 
U.S. Army Signal Corps (TM 11-459). After each signal is 
sounded, there is a pause of two or three seconds during 
which the student writes down, if he can, the character that 
was represented. Then the character is identified vocally by 
the instructor and the next signal is presented. Eventually, 
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FIG. 40. Progress curve for a group of beginning students learning 
to receive International Morse Code by the "code-voice" method. The 
data plotted are the average per cent correct responses on successive 
training "runs" of 100 randomized signals each. Prior to the first run, 
the 36 signals were identified once, and this is one reason the curve 
does not start at zero per cent correct. (After Keller, Christo, and 
Schoenfeld, 1946.) 

under this form of tuition, the student accurately anticipates 
each announcement, and the use of the reinforcing "voice" 
may be discontinued. Figure 40 shows the sort of progress 
that may be expected of college students when given one hour 
daily of this kind of training. 

2. Compound S D 's. This category of multiple discrimina* 
tions includes some cases which seem so natural to us that 
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we do not often think to inquire about them, and other cases 
which are the basis of some amusing and unusual effects. Let 
us consider, as our first example, that of depth perception. 
We accept unquestioningly the fact that we can see objects 
in depth or, better, that we react appropriately to the third- 
dimensional aspect of objects. Under ordinary circumstances, 
we reach for common objects with great accuracy and assur- 
ance, and we judge, with considerable success, the distance of 
those beyond our grasp. We pick up the pencil or book with 
a sure reach; we unhesitatingly aver that the house is nearer 
to us, or farther away, than the barn; and we even estimate, in 
feet or yards, the distance from 'here* to 'there/ Moreover, 
objects themselves appear solid. All this despite the well- 
known fact that images of objects upon the retina of the eye 
are, as in a photograph, in only two dimensions. 

The history of research tells us that many cues contribute 
to the adequacy of these adjustments, and helps us to ap- 
preciate the extreme complexity of the S D 's we employ. For 
convenience, we may summarize these cues under two head- 
ings: those which are effective even when only one eye is 
used, and those which depend upon the operation of both 
eyes simultaneously. 

1. Monocular cues include such S D 's as arise from (a) the 
interposition of objects (the nearer object hides, in part, the 
object that is farther away); (b) size and perspective (the far- 
away object is smaller than the one near-by, and the continu- 
ous change in object size with distance is perspective); and 

(c) the distribution of light and shadow (concavity and con- 
vexity, which are third-dimensional characteristics of objects, 
commonly depend upon this cue). These and one or two 
others have long been recognized by painters who sought to 
represent depth upon canvas. 

2. An important binocular cue is that provided by the 
fact that our two eyes, being apart, cannot be stimulated in 
exactly the same way by a single solid object. The depth pro- 
vided by this disparity may be demonstrated dramatically 
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with a stereoscope. This device, a fixture in the old-fashioned 
parlor, permits two slightly different bi-dimensional views to 
stimulate the two eyes separately, with a resultant tri-dimen- 
sional effect that may be very striking. 'Stereoscopic vision* 
provides us with one of our most subtle cues. 

Not all of these cues (and we have omitted several others) 
need to operate at one time to guide our adaptive reactions, 
although many of them are commonly available. Numerous 
experiments have shown that one or two may function, often 
very effectively, in the absence of the rest. The loss of an eye, 
which automatically eliminates all binocular cues, does not 
thereby render the loser helpless; it merely makes him more 
dependent upon the cues that remain. 

Some of the depth S D 's are apparently learned in the early 
movements and experiences of the child. As he reaches and 
crawls, for example, he soon learns the importance of inter- 
position, perspective, and the movement cues provided by 
the muscles of the eyes. Some are so obscure that, even with 
coaching, they are not easily appreciated. Yet the effective- 
ness of our movements in space is clearly dependent upon 
them, and these movements are continually being reinforced 
by their outcome. After awhile, we think nothing of reaching 
for objects and finding that our hand has gone just the right 
distance and in just the right direction to make the contact; 
or we throw a ball with considerable accuracy without once 
stopping to attend to the discriminative basis of the act. 

Another instance of the operation of compound S D 's is seen 
in the fact that a given object, under changing stimulus con- 
ditions, seems to retain its proper character. A man stand- 
ing on the sidewalk across the street seems as tall as a man 
should be, although his image on the retina of your eye is 
much smaller than it would be if he were standing close by. 
This phenomenon is referred to as "size constancy*'; but 
other object properties than size lead you to behave in a 'con- 
stant' way. In a similar fashion, you can judge with great 
accuracy the shape of an object despite the changing stimula- 
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tion it supplies as it moves about in your environment. You 
have no difficulty in seeing a dish as round, although its pro- 
jection on your retina is usually elliptical. The brightness of 
objects provides still another example. Your handkerchief 
looks white whether you see it in the sunlight, in the class- 
room, or in a dim corridor. The important point about all 
forms of constancy is simply that, in responding to the physical 
properties of objects, your responses are the product of the 
discriminative stimuli arising not only from that object but 
also from as much of the environment as is available. If you 
were asked to judge the size of a line of light in a completely 
dark room, you would find the task difficult you would ask 
to know, or you would try to guess, its distance from you be- 
fore you ventured an estimate. Under strict experimental 
control, the dish can be made to look like an ellipse, by cut- 
ting off all cues of its tilt with respect to other environmental 
objects and with respect to your own person. And brightness 
constancy can be destroyed by a similar reduction in the 
number of cues (such as prevailing illumination of the en- 
vironment) upon which you ordinarily depend, so that the 
handkerchief will look black under special conditions. 

"Redintegration" is another case in point. It is easy to 
show that when S D compounds are the basis of a response it 
is possible to obtain that response to a portion of the com- 
pound. The recognition of a work of pictorial art may be 
based on only one detail, or the identification of a symphony 
may be made with only a few notes carrying a simple melodic 
line. Redintegration is a case of generalization through par- 
tial identity. It is not, as was once thought, a separate prin- 
ciple of discrimination, but is to be thought of in terms of 
generalization and elements of compounded S D 's. 

3. Changing S D 's. Our third category of discriminative 
complexity is that in which S D 's are in a continuous state of 
change, and a few examples will suffice to show its prevalence. 
To a tennis player, a bounding ball gives an infinite and 
continuous series of stimuli, but his response to them must 
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be sure and quick. To the sportsman who brings down a 
duck or shatters a clay pigeon, or the skilled machinist wha 
follows the movement of the cutting tool on his lathe, the 
problem of discrimination is again one of change. For a 
fighter pilot or gunner, the same is true. A device called a 
pursuit-meter has been employed in selection tests for gun- 
ners and pilots. This apparatus tests the accuracy with which 
a candidate can follow a moving target through an erratic 
course. In general, men who do not perform well with such 
an apparatus are likely to fail in flight or gunnery school, 
hence may be eliminated in advance in order to save expense 
and disappointment. 

"Higher Units" in Perception 

In the laboratory, we seek to isolate the S D 's which enter 
into complex discriminations like the above, although we 
know that in ordinary experience they are not so isolated. 
But we would also like to know how they become integrated. 
In learning Morse code, for example, the student progresses 
to a point where he is no longer responding to discrete sig- 
nals, but is hearing a number of them together. An indica- 
tion of this is the fact that, with plain-language transmissions, 
he can "copy behind" with considerable ease, the extent of 
his lag being an indication of how much he can grasp at one 
time that is, how big his unit is. A burst of signals sounds 
to him like a word and he does not pause to break it up into 
separate letters. In early studies of code learning (Bryan and 
Harter, 1899), it was observed that before a student could 
pass beyond a certain word-per-minute proficiency he might 
spend a good deal of time during which he made no apparent 
progress. If he persisted, he could resume his advance, this 
time going on to a new level of mastery in which he could 
handle groups of signals or short words. Periods of no prog- 
ress were called "plateaus" and it was at first thought that 
these were necessary stages in which the learner consolidated 
smaller units or elements into higher units. Today we believe 
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that such intervals of no progress can be largely avoided by 
carefully and systematically combining signals and giving 
him practice on higher units, rather than allowing the stu- 
dent to form them himself on the basis of accidental com- 
bination and by dint of mere perseverance. Higher percep- 
tual units in vision can also be studied in connection with 
the so-called span of apprehension. Flash for one-tenth of a 
second a number of letters on a screen before a student who 
has been instructed to call them back. If he is tried on one, 
two, three, or more, we can quickly determine the largest 
number that he will apprehend without error, and how his 
accuracy diminishes when we exceed this number. Suppose 
we find that a given subject can unfailingly report correctly 
as many as six letters. If we change over and flash short words 
on the screen, we find that he can report about as many words 
as he did letters. If these words were three-letter words, then 
he is reporting eighteen letters whereas he could report only 
six before. Obviously, in the case of the words, the letters are 
no longer functioning as separate elements, but this time the 
words are elements. We have here another instance of the 
combination, through training, of simple S D 's into com- 
pounds or higher units. 

Concept Formation 

What is a ''concept'? This is another term which has come 
into psychology from popular speech, carrying with it many 
different connotations. We shall have to be careful in using 
it, remembering that it is only a name for a kind of behavior. 
Strictly speaking, one does not have a concept, just as one 
does not have extinctionrather, one demonstrates concep- 
tual behavior, by acting in a certain way. Our analysis should 
really start with a different question: What type of behavior 
is it that we call "conceptual"? And the answer is that when 
a group of objects get the same response, when they form a 
class the members of which are reacted to similarly, we speak 
of a concept. A child's concept of "horse" may be such that 



GENERALIZATION AND DISCRIMINATION 155 

his first sight of a cow, a mule, or a camel may all result in 
his saying "Giddap" or "Whoa," or simply "Horsie." A group 
of events may also be responded to in the same way and thus 
form a concept, such as "war." Classes of objects or events, 
differently responded to, develop different concepts. "But," 
you may say, "this is only generalization and discrimination 
all over again" and so it is. Generalization within classes and 
discrimination between classes this is the essence of con- 
cepts. 

i. The growth of concepts. It is important that we main- 
tain an objective attitude toward concepts, that we see them 
in terms of behavior. A good way to do this is to note how 
some concepts develop in human children. At birth, the 
world of the child may be, as William James said, no more 
than a "booming, buzzing confusion," but very soon he re- 
sponds in different ways to different parts of his environment. 
For example, at the age of about three months, he seems to 
show the rudiments of a 'social smile' that is, he smiles at 
other human beings. We like to think, at this stage, that he 
'knows us/ and parents fondly believe that they alone can 
evoke the expression. A test will quickly show, however, that 
the child is responding to the moving or noise-making aspects 
of parental behavior; moving animals and rattling toys will 
also set off the reaction. We may, if we wish, think of this as 
a primitive conceptual distinction between moving and non- 
moving objects. Later on, the child may reserve his smile for 
members of the family, even crying at the approach of stran- 
gers, but many months of learning must precede the advance 
of his discrimination to this stage. 

An interesting set of studies by Piaget (1929) traced the 
changes in the child's concept of living versus non-living ob- 
jects. This was done by asking children of various ages what 
they thought was alive and why, whether objects feel hurt 
when kicked or become sad when locked in a closet, and so 
forth. It turned out, as you would expect, that the criteria 
used for defining living and non-living objects change with 
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age and the accumulation of experience. At one time all mov- 
ing objects are called alive. Later on, a distinction is made 
between things that move when pushed and things that move 
'by themselves* (brooks, clouds, trees in the wind). Only grad- 
ually, and never completely, do they approach in their con- 
ceptual behavior the 'realism* so highly prized by modern 
man. Since our own movements of locomotion, respiration, 
speech, and the like are perhaps the first, and certainly the 
most important, to occasion the response "living," it is not 
strange that we find the brook 'running/ the wind 'sighing/ 
the trees 'whispering/ and the table 'groaning* under its load 
of viands. 'Animism/ which is opposed to 'realism* and which 
is said to involve the ascription of the properties of animate 
things to those that are inanimate, is readily seen to be an- 
other case of generalization. Rigorous discriminative train- 
ing is required before a concept of living objects can be 
formed which excludes this primitive extension. 

Through education, formal and informal, our concepts are 
altered and enlarged. Where once we thought that trout, eels, 
sharks, and whales were all "fish/* we learn in school that 
these organisms fall into different categories, and our con- 
cept of "fish** is radically changed. Similarly, our concept of 
airplane once included heavier-than-air craft with propellers, 
wings, engines, fuselage, and tail. Today we are modifying 
this category to include pilotless planes that are jet-propelled 
and without the conventional engine. Apparently the pres- 
ence of wings is the indispensable part of our concept of air- 
plane, since wingless ships, like rockets, do not evoke the 
name. In these cases, and in others, the difficulty of classifying 
often becomes acute. For example, we know what "fighting** 
is, but we may be unable to say whether puppies, engaged in 
rough-and-tumble, are "fighting** or "playing.** 

2. Experiments on concept formation. In the laboratory, 
studies of concept formation fall roughly within two major 
classifications. As an example of the first, we have an experi- 
ment by Hull (1920) in which subjects were taught to re- 
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spond with nonsense words (li, ta, yer, etc.) to different 
Chinese characters. Six lists, of twelve characters each, were 
memorized successively by each subject. The seventy-two 
characters were different in total composition, but each of 
the twelve characters of the first list had a component (a 'radi- 
cal') which was present in one of the characters of each later 
list, and the same response word was required whenever this 
radical appeared. Thus, the word li was applied to six dif- 
ferent characters (one in each list), all of which possessed the 
same radical. The point of the experiment was to see whether 
the learning of successive lists would be facilitated as the sub- 
ject was given more and more experience with the basic radi- 
cals. The results showed that the process of mastery was 
indeed speeded up as each new list was learned. The subjects 
came to 'form concepts' in the sense that they generalized on 
the basis of the key radicals in the different characters. More- 
over, some subjects who were able with very few errors to 
respond to the characters of the sixth list were unable, when 
asked, to identify the radicals to which they were correctly 
responding. Evidently, this kind of conceptual behavior may 
be established without a subject's being able to state in words 
just what he is doing. This is most informative when we 
consider that in common speech we normally tend to identify 
the 'possession' of a concept with one's ability to verbalize it, 
even if only approximately. 

In the second major type of experiment on concept forma- 
tion, a subject may be shown a number of objects and in- 
structed to find the common characteristic which establishes 
a category. Thus, Smoke (1932) studied the formation of ten 
concepts each of which concerned a certain type of geometri- 
cal design having a nonsense name. A "pog," for example, 
always contained a circle within a rectangle, although a 
series of "pogs" might differ in the size or some other aspect 
of the circle or rectangle. The subject examined one "pog" 
after another when they were presented and advised the ex- 
perimenter when he was ready to define the class. He was 



158 PRINCIPLES OF PSYCHOLOGY 

then asked to state what a "pog" was, to draw two examples, 
and to select from a list of sixteen test figures those which 
fulfilled the "pog" requirements. Although the ease of the 
generalizations depended upon the kind of geometrical de- 
sign employed, the subjects were usually able to single out 
those containing the same basic components. Moreover, in 
agreement with Hull's findings, they were sometimes unable 
to define a class satisfactorily when they had already 'passed* 
the other tests. 

3. Non-verbalized concepts. In view of the fact that a sub- 
ject may behave conceptually without being able to tell us 
the basis of his discrimination or generalization, it is natural 
to ask whether concepts are peculiarly human. From observa- 
tion and theoretical analysis, we are led tentatively to con- 
clude that lower animals exhibit fundamentally the same 
behavior. Consider a hunting dog, being trained to pursue 
rabbits. Early in his training he may fail to discriminate ade- 
quately and find himself embarrassed in an encounter with a 
polecat or a hedgehog. When we say, later, that he is properly 
trained, we mean, among other things, that his concept of 
rabbit will exclude these other creatures. In the laboratory, 
we may train animals, as well as young children, in concept 
formation, when there is no possibility of verbal definition. 
A child, for example, may be taught to reach toward a trian- 
gular figure for a piece of candy, while reaching to a circle 
goes unrewarded. Later, if we substitute different shapes of 
triangles and different sizes of circles, his response remains 
correct. Animals may also be taught to generalize on the basis 
of triangularity (Fields, 1932) . We approached this matter 
before, in our treatment of "similarity" and stimulus equiva- 
lence, and now we can see that equivalent stimuli is what we 
mean when we speak of a concept. 

The ''transposition" experiment provides us with another 
instance of this sort (Spence, 1936; Jackson, 1939). A child 
or an animal is trained to approach the brighter of two discs. 
After thorough conditioning, the brighter disc is removed 
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and in its place another, dimmer, one is added so that the 
same ratio of brightness exists between them as previously 
did between the earlier pair. The organism will respond 
to the disc which is now the brighter one, although it was 
formerly the dimmer. This experiment has been spoken of 
as an example of the concept "brighter than/' but the essen- 
tial fact is not altered. We are here dealing with the generali- 
zation of stimulus patterns. 

It is curious to note the resistance that may be shown to 
the notion that the term concept need not be limited to 
matters capable of being verbalized or found only in the be- 
havior of human adults. We seem to have here a problem in 
our own behavior. We have formed a concept of conceptual 
behavior which is based upon such factors as the age of the 
subject, his ability to verbalize, and the fact that he is human. 
It is true that our verbal behavior as adults becomes very 
complex and that, in philosophy, for example, we may wres- 
tle with the identifying characteristics of such verbal concepts 
as justice, virtue, and wisdom. In such highly sophisticated 
discourses, we have an example of the attempt to reach agree- 
ment as to the criteria of the just, the virtuous, and the wise 
to enumerate the essential S D 's for evoking these words. 
Disagreements arise through differences in individual dis- 
criminative histories. The problem is not unlike that faced 
by writers on semantics; if we wish to know whether two per- 
sons 'have in mind' the same thing when they use the same 
word, the answer will lie in the overlap of discriminative 
stimuli which evoke the word from each. The fact that two 
persons using the same word may agree on some of its mean- 
ings and disagree on others should not be surprising they 
have, after all, not had identical training, and the conditions 
which evoke the word in one may disagree in part with those 
which evoke it in the other. 

We have been dealing with concept formation as a resul- 
tant of stimulus generalization and discrimination. This is 
all right but, especially in the case of adult human beings, 
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it is not the whole story. An important sector of adult be- 
havior is verbal in nature, and the analysis of this behavior 
must precede a fuller understanding of conceptual activity. 
In the concluding chapter of this book, a brief sketch will be 
offered of a promising theory of verbal behavior that is in 
accord with the general principles you have learned. After 
you have read that sketch, you might profitably turn back to 
review what is said here. 

4. Concepts and mediated generalization. Generalizations 
are said to be mediated when they are based upon a stimulus 
equivalence which results from training (Gofer and Foley, 
1942). Some concept formation is an example of such equiva- 
lence. The words vase and urn have few stimulus properties 
in common. If we were to condition a respondent or an 
operant in a young child to the sound of one of these words, 
there would be very little generalization to the sound of the 
other. An adult, however, who has learned that these words 
are almost synonymous and may be used interchangeably, is 
likely to show considerable generalization. Following the 
lead of Razran (igsgb), who first explored this area, Riess 
(1940) conditioned the galvanic skin reflex initially to one 
stimulus word and then tested for generalization to two other 
words, one of which was a synonym and one of which was a 
homonym of the first. For the stimulus words themselves (style, 
freeze, surf, and urn\ there was an average gain of 346 per 
cent in magnitude of skin response through conditioning. 
The synonyms (fashion, chill, wave, and vase) gained 141 per 
cent through generalization; and the homonyms (stile, frieze, 
serf, and earn) gained 94.5 per cent. Whereas the generaliza- 
tion to the homonym illustrates simply stimulus generaliza- 
tion, that to the synonym illustrates mediated generalization 
based upon the previous training which produced the 'mean- 
ing* equivalence of these two words. (We shall have more to 
say about 'meaning' in Chapter 7.) Riess (1946) was also able 
to show that, for children below twelve years of age, there 
was more homonym than synonym generalizationa finding 



GENERALIZATION AND DISCRIMINATION 

that accords well with the generally accepted belief that the 
meaning value of words increases with age. 

Table VI 

MEDIATED GENERALIZATION IN THE CONDITIONING OF THE GALVANIC SKIN 

RESPONSE TO VERBAL STIMULI IN SUBJECTS OF VARYING AGE 

(Data from Riess, 1946) 





Mean Age 


7 yrs. 9 mos. 


10 yrs. 8 mos. 


14 yrs. o mos. 


18 yrs. 6 mos. 


Number of 
Children 


21 


22 


25 


24 


Word 
Homonym . . . 
Antonym . . . 
Synonym . . 


222 

159 
139 
129 


227 
65 
97 
60 


243 
60 

77 

110 


281 

5* 
103 

*49 



In the experiment, the GSR (measured in microamperes) was first 
conditioned to a word; after which generalization was tested to a 
homonym, an antonym, and a synonym of that word. Exposure of the 
stimulus words was visual, and of three-second duration. The table is 
to be read as follows: The 21 children aged seven years and nine months 
(average) showed an increase in GSR to the word after conditioning of 
222 per cent of their pre-conditioning GSR to that word; the degree of 
generalization between the word and its homonym is shown by a GSR 
to the homonym of 159 per cent of the homonym's pre-conditioning 
GSR value; between the word and its antonym, 139 per cent; between 
the word and its synonym, 1 29 per cent. At this age, stimulus (auditory) 
similarity between the word and homonym outweighs the 'meaning' 
similarity: but, by the age of 14, the order of generalization is reversed, 
with mediated generalization through 'meaning' being the largest. 

Generalization and Discrimination in Education 
The process of education is greatly concerned with gen- 
eralization and discrimination. It is possible that, at the be- 
ginning of life, all stimuli may generalize to produce mass 
and profuse responses in the infant. As he matures and learns 
to discriminate objects in his environment, those generaliza- 
tions which persevere during his pre-school years will prob- 
ably be adequate for most of his gross adjustments, but they 
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must be broken down later in the interests of his educational 
progress. The new generalizations and discriminations will 
in turn undergo change as his schooling continues. From his 
science teachers, for example, he will learn that falling stones 
and falling paper obey the same law, that fish breathe in a 
way very similar to ours, that an alley cat and a lion have much 
in common. In fact, from one point of view, the whole busi- 
ness of science will be seen as the arrangement of nature's 
facts into new categories, with a stress upon the important 
but not obvious similarities and a disregard for the obvious 
but unimportant dissimilarities. Even changes in the funda- 
mental theories of science are of this nature. The history of 
great discoveries is one of the reorganization of facts into 
new classifications on the basis of related properties. Plan- 
etary movements are tied into one general law with the 
motion of falling bodies. Electric currents are tied to the 
behavior of ions in chemical solutions. Nerve impulses are 
shown to be electrical phenomena. The behavior of all or- 
ganisms follows the same basic laws. On simple and complex 
levels of animal and human behavior, the operation of gen- 
eralization and discrimination are among the most important 
phenomena with which we deal. The Army dog that growls 
at men in strange uniform but greets its own soldiers; the 
child who learns to tell "Daddy" from "Mommy" (and may 
then call all men "Daddy"); the student who learns that bats 
and whales are both mammals, not bird and fish; and a psy- 
chologist who cites three apparently different instances of be- 
havior as examples of the same basic laws all of these are 
doing very much the same thing. 

NOTES 

A very readable source on respondent generalization and discrimina- 
tion may be found in Pavlov's (1927) own book, especially in Chapters 
VII and VIII. The statement of the principles as we have given them 
(tor both operants and respondents) have been drawn, however, from 
Skinner (1938). 

If you are interested in the technical details of the psychophysical 
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methods, Woodworth (1938) offers a thorough account. A comparison of 
human sensory capacities with those of animals at various phylogenetic 
levels has traditionally been dealt with by the comparative psychologist. 
For numerous illustrations in this area, you may consult Warden, Jen- 
kins, and Warner (1935-1940) . 

The early history of the "personal equation" and the study of reaction 
times has some entertaining features, well brought out by Boring (1929) 
and Woodworth (1938). 

Among the first to recognize the importance of a systematic attack 
upon the problem of "mediated generalization" were Gofer and Foley, 
in 1942 and later. The problem has not yet been investigated, however, 
to the degree that its importance would seem to justify. 



RESPONSE VARIABILITY AND 
DIFFERENTIA TION 




After E. R. Guthrie and G. P. Horton, Cats in a Puzzle Box, 1946 



Introduction 

In this chapter, we shall deal exclusively with operant be- 
havior. We shall concentrate upon the behavior itself, rather 
than the stimuli that set the occasion for its emission. Indeed, 
we shall deal with changes in behavior that take place ir- 
respective of the S D situation. Our major concern will be 
with three related matters. First, we shall consider the fact, 
perhaps obvious, that operant responses may differ on suc- 
cessive emissions, even when positive reinforcement is regu- 
larly applied. Secondly, we shall try to relate this variability 
to the influence of negative reinforcement that results di- 
rectly from the emission of the operants themselves an in- 
fluence that may accompany the positive reinforcement. 
Finally, it will be shown that different strengths may be 
given to different variations of a response by applying posi- 
tive reinforcement in a selective manner. In treating these 
matters, our task will not be simple, because the data are 
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incomplete and still a source of conflicting opinion; but the 
problems are important and cannot be by-passed, even in a 
text for beginners. 

Variability versus Stereotypy 

Suppose that, in the situation where a white rat obtains 
food by pressing a bar, the first response to be reinforced is 
that of bar-biting, rather than a leaning, pressing, or climb- 
ing response. We would expect, would we not, that this re- 
sponse would be strengthened: that it would be more likely 
to appear at a later time that it would gain a definite ad- 
vantage over other possible modes of bar depression? We 
would expect, too, if it did recur and was again reinforced, 
that its probability of further recurrence would increase even 
more. Eventually, we would look for a considerable automat- 
ization or stereotyping of the rat's bar-pressing behavior. We 
would expect a single, fairly restricted mode of response to 
develop. 

Although the degree of automatization actually achieved 
in this situation is not as great as the above paragraph sug- 
gests, observation indicates that, within limits, something of 
the sort actually does take place regularly in the case of the 
rat, and there are supporting data from experimental studies 
of other organisms. Guthrie and Horton (1946) observed and 
photographed approximately 800 solutions of an escape-to- 
food problem by 52 cats, obtaining camera records of all the 
"escape postures" used by thirteen of their subjects. Solving 
the problem required that the cat's responses bring about 
a slight inclination of an upright pole within an otherwise 
unfurnished response-chamber. A small deviation of the pole 
from the vertical position was enough to open a door in the 
glass front of the chamber, whereupon the animal was free to 
obtain a bit of salmon from the top of a table a few inches 
away from a point of exit. Contact of any sort with the release 
device occurred ' 'inadvertently " on the animal's first trial, 
after a period of sniffing, clawing, or pawing at the glass door, 
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and after various turns about the chamber. This was appar- 
ently a rough equivalent of the behavior described by Thorn- 
dike many years earlier note the quotation on page 38. The 
final, and effective response of the series took many accidental 
forms. One cat brushed the pole with his flank; one stepped 
on the base of the pole with his hind foot; another backed 
into the pole; and so on. After escape and feeding, the ani- 
mal was again permitted to enter the response-chamber from 
a starting-box. On this occasion, and on later ones, the cat's 




Animal # 
Trial # 



\ X 



Fie. 41. The apparatus used by Guthrie and Horton in their photo- 
graphic study of response variability in cats. The front wall of the 
"puzzle box" was made of glass to allow the taking of pictures. The 
striped pole, when pushed in any direction, would operate the door- 
opening mechanism and allow the animal to emerge from the box to 
reach a food reinforcement lying outside. The clock gave the time taken 
to get out after the animal was put in. (After Guthrie and Horton, 1946.) 



behavior often seemed to be a "detailed repetition" of that 
observed on the first trial. In not a few cases, one sample of 
release behavior of a given animal was indistinguishable 
from another in practically every aspect. Some variation in 
response was noted: minor differences in escape postures were 
fairly common and, in some cases, markedly different modes 
of escape were used by the same animal on successive trials. 
Guthrie and Horton were, however, most impressed by the 
stereotypy of the behavior, rather than the variations in it. 
(The variations were accredited by them to the influence of 
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slight changes in the stimulus situation occasioned by differ- 
ent approaches to the pole, accidental distractions, and other 
"interfering" factors.) 

Somewhat different from the Guthrie-Horton study was an 
earlier one by Muenzinger (1928), who taught thirteen 
guinea pigs to press a bar in a three-chamber problem box. 
Each animal entered the response-chamber from a starting- 
box and was given access to a food-chamber whenever he 
opened the door to it by pressing a lever. As soon as the food 
was eaten, the animal returned or was forced back to the 
starting-box where he remained until the experimenter was 
ready for the next trial. As a rule, fifteen trials were given to 
each of the pigs daily and a total of either 600 or 1000 solu- 
tions of the problem was accumulated. These solutions were 
not photographed, but Muenzinger was able to distinguish 
nine different "patterns" of successful response (three right- 
paw and three left-paw patterns, a two-paw pattern, a head- 
movement and a biting or gnawing response). He found that 
only one animal, a pig that never made the biting response, 
failed to display all nine patterns at one time or another dur- 
ing the experiment; and he noted that the animals changed 
from one pattern to another as frequently at the end of their 
training as they did in the earlier trials. Some degree of ster- 
eotypy did appear; there was a decrease in the average num- 
ber of different patterns displayed from the beginning to the 
end of the experiment, and most of the responses at the end 
fell within one or two of the nine categories. But Muenzinger 
emphasized the "plasticity" rather than the "mechanization" 
of the behavior. Even when the same pattern occurred on 
many successive occasions, he tells us that it was seldom du- 
plicated exactly from trial to trial, and might be suddenly 
supplanted by an entirely different pattern. Only three of 
the thirteen animals showed a preference for a single re- 
sponse pattern from start to finish. 

Close examination of these studies suggests that the ob- 
served behavior of Muenzinger's guinea pigs closely par- 
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alleled that of the Guthrie-Horton cats. The principal 
difference involved seems to be one of descriptive emphasis. 
In one case, the stereotypy or mechanization of response is 
stressed; in the other, the variation and plasticity. Muenzin- 
ger, in addition, seems to have been more alert to the changes 
in variability that occurred as a result of prolonged training. 
Both experiments, however, raised questions of real import- 
ance to our understanding of behavior. Some of these ques- 
tions will be considered here; others are still to be investi- 
gated. 

Response Induction 

We have seen, under the heading of generalization in the 
preceding chapter, that the reinforcement of a response in 
the presence of one stimulus will strengthen it in the pres- 
ence of other stimuli when these have properties in common 
with the first. There is, apparently, something like this on the 
side of response. The strengthening of one response may 
bring about a strengthening of certain others, where there is 
no change in the external stimulus situation. Thus, the re- 
inforcement of one act of bar-pressing will affect, to a lesser 
degree, the strength of other acts which differ from the re- 
inforced one in several identifiable aspects. 

One way in which a response may differ from another is 
in topography, by which we mean, in this context, the form 
or kind of a response. The movement of one leg, for example, 
is topographically different from the movement of another, 
or from the movement of an arm or a finger. Also, responses, 
even when topographically similar, may differ in their force 
or their duration, properties which may be subjected to quan- 
titative measurement. Thus, the bar-pressing response, even 
when quite stereotyped, will vary appreciably in its intensity 
or force, as well as in the amount of time that the bar is held 
down on each successive response. The question we now ask 
is this: will the strengthening of a response having a certain 
topography, force, or duration lead to or 'induce' a strength- 
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ening of responses which differ from it in one or more of 
these properties? 

Let us begin with topography. Kellogg (1939) conditioned 
the flexion of a dog's right hind leg to the sound of a buzzer. 
The response was originally elicited by shock and its strength 
was maintained through shock avoidance. Along with the es- 
tablishment of this foot-withdrawal, Kellogg noted that oc- 
casional movements of the other legs appeared. The number 
of such flexions was greatest for the left hind leg, next great- 
est for the right front leg, and smallest for the left front leg. 
That is, a sort of 'topographical gradient* was observed. Hil- 
gard and Marquis (1940) have treated this as an example of 
response generalization and have supplied other observations 
of a similar sort. For ease of reference, we shall speak of such 
phenomena as cases of response induction or, simply, induc- 
tion. 

An interesting maze-learning experiment, suggesting re- 
sponse induction, was carried out by MacFarlane (1930). Af- 
ter training one group of rats to swim through a maze, he 
inserted a floor beneath the surface of the water at a depth 
sufficiently shallow to force them to run the maze. For an- 
other group, the procedure was reversed. When the rats of 
each group were tested after the shift, no appreciable increase 
in errors resulted. Transfer was apparently complete, in spite 
of the fact that the responses utilized by the animals were 
observably different under the conditions of swimming and 
running. 

Other cases suggesting topographical induction appear in 
various experiments commonly treated under the heading of 
transfer of training. Starch (1910) discovered that human 
subjects who were given practice in tracing the outline of a 
star-shaped figure, when the figure was seen only in a mirror 
and a partial reversal of one's customary movements was 
therefore required, showed a gradual improvement in per- 
formance which was not limited to the hand used during 
practice. In right-handed subjects, the left hand profited con- 
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siderably even when it had been given no training at all in 
the tracing task. Other investigators have confirmed these 
results and added information on the problem. Thus, Bray 
(1928), employing a technique in which subjects struck at a 
mirrored target, showed that the transfer effect was not 
merely bilateral (from the practiced to the unpracticed hand), 
but also extended to the homolateral foot (i.e., right hand to 
right foot). Such results remind us of Kellogg's findings with 
dogs, but they are probably not to be interpreted as simple 




FIG. 42. A mirror-drawing apparatus. The subject's hand is hidden 
from his direct view, but he can see it and the pattern to be traced in 
the mirror. The reflection, of course, reverses the actual movements to 
be made in tracing. (Courtesy of C. H. Stoelting Co.) 

cases of induction. Bray was able to demonstrate that other 
factors, such as a "transfer of methods/' were involved. 
Human beings tend to verbalize what they are doing, and any 
induction that exists between hand and foot in this kind of 
experiment may be mediated or secondary, rather than direct 
or primary. 

When we come to consider quantitative induction, the case 
becomes somewhat clearer. Take the property of response in- 
tensity or force. It has been demonstrated that a rat, even 
after long practice in bar-pressing, will continue to show vari- 
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ations in the amount of pressure exerted on successive occa- 
sions. In an experiment by Skinner (1938), rats were first 
conditioned in the usual manner and regularly reinforced 
during several training sessions, after which the response was 
extinguished. By connecting the bar with a pendulum and 
suitable recording apparatus, it was possible to obtain cumu- 
lative curves of response intensities during both the rein- 
forcement and non-reinforcement sessions. The slopes of 
these curves provided an indication of the average force of 
response as well as the degree to which the rats deviated from 
this average under the two experimental conditions. It 
turned out that, under regular reinforcement, the average 
force amounted to 35-40 grams, although many of the re- 
sponses were weaker or stronger than this. (A ten-gram force 
was required to depress the bar, so practically all pressings 
were strong enough to be reinforced.) Under extinction, 
there was at first a tendency for the animals to respond with 
more than the average force during conditioning, but this 
was followed by a marked decrease as the process neared com- 
pletion. 

Data on the distribution of forces in such a situation have 
been collected by Hays and Woodbury (reported by Hull, 
*943) i n the Yale laboratory. They reinforced bar-pressing 
with food whenever the response intensity amounted to 21 
grams or more, until 100 reinforcements had been given. The 
frequency with which different intensities appeared, in the 
case of one of their rats, is shown in the second column of 
Table VII. The third column of this table shows the values 
obtained with the same animal when a minimal force of 38 
grams was needed. 

It is obvious from these data that complete stereotypy in 
the force or intensity of the bar-pressing response is not 
achieved under these conditions. The reinforcement of a 
response having a certain intensity apparently suffices to 
strengthen topographically similar responses having widely 
different intensities. 
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Table VII 

THE DISTRIBUTION OF RESPONSE INTENSITIES IN THE HAYS-WOODBURY 

EXPERIMENT, UNDER Two CONDITIONS OF REINFORCEMENT 

(Data from Hull, 1943, p. 305) 



Response Intensity 


Intensity Required for Reinforcement 


21 Grams 


38 Grams 


13-16 Grams 
17-20 " 


3* 
4* 
*3t 

20 

37 
16 

12 
6 
o 



o 
o 


o 
i 

2 
2 

4 

8 

9t 
39 
35 
11 

5 
3 


2124 " 


25-28 " . . 


2Q 92 " 


33-36 " 


37-40 " 


41-44 " 


45-48 


49-52 " 


53-56 " 

57- 



* No responses reinforced, f Nine responses reinforced. 
J Seven responses reinforced. 

This amount of variation should not be surprising. It is 
obvious in most of our own actions and has often been meas- 
ured. Table VIII gives sample results from an experiment of 
Thorndike's (1931) in which subjects, with eyes closed, were 
asked to "draw a four-inch line with one quick movement/' 
During successive sittings, a total of 3,000 lines was drawn by 
each subject, under the same experimental conditions and 
without knowledge of results, the only reinforcement being 
the approval of the experimenter for doing the work. On the 
first day, in which 192 responses were made, the range of line 
lengths for one subject was 4.1 to 5.7 inches; on the twelfth 
day (175 responses), the range was 4.5 to 6.2 inches. The dis- 
tribution of lengths on both these days is shown in the table. 
Aside from a slight shift upward in the average length of 
line drawn, which is no more than a daily variation, the two 
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distributions are essentially the same. If this response under- 
went any decrease of inductive effect, it must have done so 
before these periods of prolonged practice were instituted. 
While the spread of responses in session twelve appears less 
than that in session one, this does not indicate a day-to-day 
reduction with practice, since the intervening days show con- 
siderable fluctuation. 

Table VIII 

THE DISTRIBUTION OF LINE LENGTHS DRAWN BY A SUBJECT DURING 
Two PRACTICE SESSIONS IN ONE OF THORNDIKE'S EXPERIMENTS. 

(Data from Thorndike, 1931, pp. 8-9) 
Length of Line Experimental Session 

(inches) I XII 

4.10-4.19 2 O 

4.20-4.589 i o 

4-30-4-39 4 o 

4.40-4.49 3 o 

4-50-4-59 ll 3 

4.60-4.69 11 o 

4.70-4.79 21 6 

4.80-4.89 26 6 

4-90-4-99 3 *3 

5.00-5.09 20 . . 25 

6-10-5.19 22 27 

5.20-5.29 16 24 

5-30-5-39 i* 3 

5.40-5.49 7 17 

5-50-5-59 3 i* 

5.60-5.69 2 7 

5-70-5-79 3 

5.80-5.89 o o 

5-90-5-99 o i 

6.00-6.09 o o 

6.10-6.19 o o 

6.20-6.29 o i 



The Why of Variability 

One fact emerges plainly from the discussion in the two 
preceding sections. Under all conditions of reinforcement 
or non-reinforcement thus far described, some degree of 
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response variability survives complete stereotypy is never 
achieved. This appears in the Guthrie-Horton study no less 
than in the studies of Muenzinger. It is present, too, in all 
the cases of induction that we have cited. But we are still 
without an explanation of this variability. Even assuming 
that the strengthening of one response will strengthen other, 
slightly different ones, how does it happen that a specific re- 
sponse with one specific set of properties does not, by virtue 
of more frequent reinforcement, come to be the only re- 
sponse emitted in a given experimental situation? 

The answer to this question demands that we postpone for 
a while the consideration of the second major topic of this 
chapter, and turn our attention to a factor that we have hith- 
erto encountered only in its grosser aspects. Our side trip will 
not, however, be wasted, because we shall discover some mat- 
ters of considerable interest and significance. 

Variability and Negative Reinforcement 

Let us go back for a moment to the Guthrie-Horton ex- 
periment. Suppose that, in this study, one of their cats first 
operated the release-mechanism by falling backward upon 
the pole while trying to climb the near-by wall. The falling 
response would presumably be strengthened through its pos- 
itive reinforcement, but the fall might also provide stimula- 
tion of a negatively reinforcing sort. The consequences of the 
fall would thus exercise a depressive effect upon the immedi- 
ately preceding response, just as a mild electric shock might 
do. In line with this way of thinking, when the cat re-entered 
the situation on the next trial, he would not be as likely to 
reinstate the successful behavior as he would if some other 
action had brought about his escape from the box and get- 
ting of food. Some alternative mode of response, whether pre- 
viously rewarded or not, would be expected, and another 
solution might soon be forthcoming. 

Or, suppose that the cat's first positive reinforcement came 
from standing on his hind legs and stretching his body to 
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reach the top of the pole. These straining movements would 
also provide a negatively reinforcing state of affairs, albeit of 
a mild degree. With repeated trials, we would look for a more 
or less gradual reduction in the amount of energy expended 
in his response to the pole. In everyday language, we would 
say that the cat should learn to take the easiest, rather than 
the most awkward or tiresome way of reaching his objective. 
(It might be, too, that the awkward response would be a 
somewhat slower one and be at the additional disadvantage 
of having its reinforcement delayed.) 

There is not much evidence in the Guthrie-Horton ac- 
count, or in Muenzinger's report of the behavior of his 
guinea pigs, to suggest the operation of this factor in these 
experimental situations, but Thorndike has given us some 
dramatic instances. He taught both dogs and cats to lick or 
scratch themselves in order to escape from the problem box, 
and found that these operants decreased in magnitude under 
regular reinforcement until they were mere vestiges of the 
original. Thus, licking might be reduced to a mere jerk of 
the head downward. Similarly, Lorge (1936) trained rats to 
make either a face-washing, standing-up, 'begging/ or scratch- 
ing movement, reinforcement being escape from a problem 
box to food. He noted a "short-circuiting" of these responses 
with successive trials. "The responses became more perfunc- 
tory and stereotyped. The 'face-wash* changed from a vigor- 
ous wash to a rapid movement of both forelegs to the face; 
the 'scratch* changed to a rapid flexion of the hind-leg to the 
flank, only remotely reminiscent of the first response to ir- 
ritation." When individual rats were trained in all three re- 
sponses of face-washing, standing-up, and 'begging/ they 
tended gradually to eliminate the first two responses entirely, 
securing reinforcement by making a "perfunctory" beg. Re- 
lated observations were made when other rats were condi- 
tioned to touch any one of four differentially accessible 
projections in their problem box with either forepaw. The 
animals ended by responding most frequently to the projec- 
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tion that required the least movement and was closest to the 
door through which reinforcement could be reached. The 
last-mentioned experiment is remindful of an earlier one by 
Gengerelli (1933), who trained ring-tail monkeys to depress 
two (or four) levers in sequence for escape-to-food. He ob- 
served that his subjects passed through successive stages of 
(i) "over-exertion," in which they often climbed up on the 
levers; (2) two-handed clasping and tugging at the levers; (3) 
a one-handed grasp-slap combination of movement; (4) a 
mere downward slap; (5) an ineffectual 'pass* at the lever; and 
(6) even cases in which the animals "would rush up to the 
lever, then precipitately turn to run to the next one," with- 
out any lever contact. 

Snll another example, at the human level, comes from an 
experiment in which Thorndike (1931) required students, 
during a fourteen-day period, to make 3,360 word comple- 
tions of a long list of such word-beginnings as ab, af, bo, and 
the like. In some cases, the same beginning was repeated as 
many as twenty-eight times in the course of the experiment, 
and it was possible to note any change in the nature of the 
completion that occurred. Thorndike discovered a strong 
tendency for the subjects to reduce the length of the complete 
words as the amount of repetition of the word-beginning 
increased. Thus, one student, upon the first eight presenta- 
tions of el, responded with elephant five times, elevate twice, 
and elf once; upon the last eight presentations, he responded 
with elf exclusively. Presumably, the effect of very mild nega- 
tive reinforcements, in conjunction with the positive rein- 
forcement supplied by any adequate and rapid completion of 
the task, was sufficient to offset the influence of positive rein- 
forcement for the first-made "long" solutions. This appears 
to be related to the fact that the most frequently occurring 
words in the English language tend to be shorter than those 
appearing less often. It is also reflected in the various trunca- 
tions and substitutions observed when initially long words 
come into common use within one or another "verbal com- 
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munity." We speak of our car, our auto, or our bus instead of 
our automobile; we look after a strep throat; we send a wire 
or use the phone; and we present an S D rather than a discrim- 
inative stimulus. In addition, it may account for some of the 
slurring tendencies often observed in colloquial speech, as 
when errors become airs, flowers become flars, and borrow 
becomes bar. 

Negative Reinforcement and the "Law of Least Effort" 

Another line of evidence for the depressive effect of re- 
sponse-produced negative reinforcement comes from studies of 
alternation behavior in animals, especially the rat. Psycholo- 
gists have for years been interested in the fact that rats, when 
given a choice of two equally long routes to food, as in a single- 
unit T-maze, tend to avoid repetition of the last-made re- 
sponse. Thus, a run to the left or to the right is commonly 
followed on the next trial by a run in the opposite direction. 
Some researchers (e.g., Hunter, 1928) have even spoken of an 
'innate' or 'natural' tendency of the animals to behave in this 
manner. Solomon (1948) has recently reviewed the observa- 
tions in this area and, revising an earlier formulation by Hull 
(1943), has proposed an explanation that jibes well with the 
analysis that we have presented in the preceding section. In 
a choice situation, with conditioned responses of presumably 
equal strength, the emission of one response provides for its 
own temporary depression and thus paves the way for the 
emission of an alternative. Solomon (1946) found that greater 
"effort requirement in T-maze running is accompanied by a 
greater frequency of alternation behavior." 

One would not expect such a minute amount of response- 
produced negative reinforcement to be long-lasting in its 
depressive effect; and the shortness of effect is confirmed in 
an experiment by Heathers (1940). This investigator found 
that the amount of alternation in a single-unit T-maze de- 
creased as the time between successive runs in the maze 
increased. 
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Various researchers and theorists have treated observations 
of the sort described here as illustrations of a law of "least 
action," "minimal effort/' or "less work." Such a formulation 
is non-analytical and should properly be considered as a 
corollary of something more basic (Hull, 1943); but we are 
all familiar with the kind of behavior to which it refers. 
Although these are the more obvious instances, something of 
the same sort probably accompanies every response, because 
every response requires effort. 

Under certain circumstances, of course, a response will be 
maintained in considerable strength even when it seems to 
be inefficient and awkward. This situation occurs when rein- 
forcement has regularly been denied to any easier variations 
of the response or, as we shall see presently, when motivation 
is higher. Gilhousen (1931) showed that rats, after having 
been trained to jump from one small platform to another of 
a straight-away series leading to food, persisted in their jump- 
ing response much longer than was necessary after an unob- 
structed pathway was set up alongside of their platform route. 
"Doing it the hard way" is apparently as characteristic of rats 
as of human beings, and has the same explanation: the hard 
way has too often been the only reinforced way available. 
Unless negative reinforcement is extreme and is clearly con- 
tingent upon a specific mode of response, all of us waste our 
energies needlessly. Industrial psychologists and 'efficiency 
experts' are well aware of this, and much of their research is 
directed toward identifying less effortful ways of reacting 
which, in the course of daily occupational routine, would 
never be appreciated or adopted by the individual workers 
under observation. How many of us would discover, by our- 
selves, that one brand of typewriter may involve only three- 
fourths as much work in its operation as another? And how 
many "hunt-and-peck" typists would be ready to change their 
style, in spite of the fact that they may know of a better method? 
The degree of negative reinforcement that accumulates as a 
result of our hunt-and-peck procedure is not great enough to 
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act as a specific depressant of the responses that brought it 
about. We may, by ourselves, make notable progress: we re- 
duce somewhat the amount of unnecessary bodily movements; 
we no longer punch the keys with a force that makes our 
fingers sore; we may even become two-finger touch-typists, 
thus eliminating certain movements of the head and eyes. But 
seldom, if ever, do we attain the efficiency of a well-trained 
performer who has had the advantage of an experienced 
teacher. Like Gilhousen's rats, we persist in our energy- 
wasting ways, and our resistance to change is the stronger 
because the better way requires, at the outset, an extinction 
of old, and often very strong, responses as well as a condition- 
ing of new ones. Add to this the fact that reinforcement for 
the new way is probably not as great, at the beginning, as that 
provided by the old. It is no wonder that we cling to our own 
"jumping" responses. 

Resistance to the adoption of improved working methods, 
so often encountered by the applied psychologist, is, one must 
admit, not always due to the fact that less satisfactory methods 
have been more often reinforced or that new ones must be 
conditioned. Sometimes the organism has no choice in the 
matter. The standard keyboard of a typewriter is un- 
doubtedly inefficient in terms of energy expenditure, yet it 
is unlikely to be replaced by one that would provide a more 
equable distribution of effort for the fingers of each hand 
for commercial reasons that have nothing to do with the un- 
willingness of a typist to give up the old and take on the new- 
ways of behaving. 

Motivation and Variability 

The exclusive reinforcement of a single response variant 
apparently reduces the probability that other variants will 
occur, but there is still another factor that deserves attention. 
Under strong motivation, an increased stereotyping of re 
sponse may also be observed. In an experiment conducted by 
Elliott (1934), rats were permitted to reach food at the end of 
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any one of five short alleys that diverged from a common start- 
ing-point. The alleys were of equal length and, when hunger 
motivation was weak, the rats showed no marked preference 
for one alley over another. However, when hunger was in- 
creased, each animal tended to take one pathway to the exclu- 
sion of the rest. Moreover, this decrease in variability was 
irreversible: a return to conditions of weak motivation did 
not reduce the amount of fixation upon the alley chosen 
when the motivation was strong. 

Quantitative data are lacking for the effect of motivation 
upon the range of bar-pressing movements, but frequent 
observations suggest that a narrowing-down effect occurs. For 
example, when the response removes a noxious stimulus such 
as a strong light, a well-conditioned rat may be seen to main- 
tain a single, crouching posture close beside the bar during 
most of the experimental session, even during the faint light 
of the reinforcement periods, and with all his bar-pressing 
movements greatly restricted in their variety. Occasionally, 
he may depart from this routine, possibly because of the nega- 
tive reinforcement produced by prolonged muscular tension, 
but his behavior is, in general, strikingly machine-like in 
quality, and it seems to retain this character for some time 
after the motivating stimulus has been greatly reduced in its 
intensity. 

Everywhere about us we see human actions that appear to 
be at least roughly analogous to such laboratory phenomena 
as these. Heightened motivation seems to be one source of 
the routinizing and stereotyping that marks our daily habits; 
and we adhere to such patterns of conduct when the pressure 
no longer exists. Yet, there is probably no single example that 
we could give which would not also suggest the operation of 
more than one factor; and it would certainly be foolish at this 
time to ascribe to motivation alone all of the tenacity and 
case-hardening of human behavior that we may observe. 
What is needed now, more than anything else, is an expan- 
sion of research in this very important sphere. 
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Extinction, Reconditioning, and Variability 

A recent study by Antonitis (1950) sheds some light upon 
the way in which variability is altered during the regular 
reinforcement, extinction, and reconditioning of operant be- 
havior. Rats were permitted to run from a release box across 
an open area to a horizontal slot (50 cm. long) in the wall, 
twelve inches from, and facing, the release-box door. Inser- 
tion of the rat's nose into the slot at any point interrupted a 
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FIG. 43. Plan of Antonitis's apparatus for studying response vari- 
ability in the white rat. Photographs of each nose-insertion response to 
the slot allowed the investigator to take two measures of the response: 
the locus of the response along the slot, and the angle of approach to the 
slot. Both measures yielded information about the degree of response 
variability (or its converse, stereotypy); and both measures, as it turned 
out, told the same story. The labeled parts of the apparatus are as fol< 
lows: A electric counter; .# mirror tilted at a 45 angle to the base 
of the experimental cage; C reflected image of the 50-011. response slot; 
D clear plastic rear wall containing response slot; E white lines painted 
on black floor of cage; Fstarting-feeding compartment; G food tray; 
5 spotlight; P photoelectric cell. (After Antonitis, 1950.) 
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beam to a photo-electric cell, causing the animal to be pho- 
tographed in position, whereupon his return to the release 
box was reinforced with a pellet of food. Experimentation 
was begun after one operant-level session in which beam- 
breaking responses were recorded but not reinforced. Five 
daily conditioning sessions, in which a total of 225 reinforce- 
ments was given, were followed by two one-hour extinction 
sessions, one session of reconditioning (50 reinforcements), 
two more sessions of extinction, and a final day of recondi- 
tioning. Figure 44 shows the way in which the distribution 
of slot-response positions varied for one animal under these 
different procedures. During the first five days of condition- 
ing, there was a decrease in the variability of response posi- 
tions; in the next two days, during extinction, the variability 
increased; and this was followed by a marked decrease on the 
first day of reconditioning. This decrease after extinction was 
characteristic of all the rats and amounted to a greater degree 
of stereotypy than had been achieved during the five days of 
conditioning. It remained at approximately the same level in 
the final reconditioning session, after the third and fourth 
hours of extinction. A control group of animals, not subjected 
to the extinction periods but treated like the experimental 
animals in every other respect, showed no comparable in- 



LECEND FOR FIG. 44 

Daily frequencies of response positions along the 50-cm. slot for one of 
the experimental rats in Antonitis's study. The experimental sequence 
may be followed by taking the graphs from left to right along each row, 
starting with the top row. Notice: (a) the wide variability of the 
operant level responses; (b) the trend toward lessened variability during 
the fivj original conditioning days, when each nose-insertion response 
was reinforced by a food pellet provided in the tray of the starting- 
feeding compartment; (r) the reappearance of greater variability during 
extinction days i and 2 (as well as the smaller number of responses on 
extinction day 2 as against extinction day i); (d) the heightened stere- 
otypy on reconditioning day i; (e) the return of wide variability on 
extinction days 3 and 4; and (/) the extreme stereotypy on recondition- 
ing day 2. (After Antonitis, 1950.) 
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crease of stereotypy beyond the first five periods of con- 
ditioning. 

The Differentiation of Response 

Despite the degree of stereotypy that may be achieved 
through the influences just discussed, a complete mechaniza- 
tion of response is probably never reached. To the person 
who sets a high premium upon efficiency and precision of 
movement, this failure may represent a fundamental weak- 
ness of our biological function. From another, and wiser 
viewpoint, it is a boon, since it permits the development of 
adaptive behavior that otherwise might never have been 
included in an organism's repertory. 

The procedure by which we produce novel responses in an 
animal or a human being is simple to describe, although not 
at all easy to execute. In essence, it amounts to this: we select 
one (or more) of the "natural" variations of a well-condi- 
tioned response and give it exclusive reinforcement; the 
remaining variations are subjected to extinction. If we pick 
out, in advance, a variation that has been of fairly frequent 
occurrence, and if we apply this selective reinforcement 
rigorously, we can soon produce an increase in the frequency 
of the response that possesses the property or properties (for 
example, the force) that we have chosen. At the same time, 
we decrease the frequency of those responses that do not meet 
our specifications. 

Skinner (1938) has called this shift in frequency through 
selective reinforcement the differentiation of response, to 
distinguish it from the discrimination of stimuli, to which it 
is analogous; and he conducted several experiments on the 
differentiation of both force and duration of bar-pressing. It 
will be profitable to consider some of this work briefly, to see 
how the principle operates. In doing so, we shall limit our- 
selves to his observations on changes in force of response. We 
take, as our point of departure, the experiment mentioned 
earlier in this chapter under the heading of induction. 



RESPONSE VARIABILITY AND DIFFERENTIATION 187 

When the normal force of the bar-pressing response, and 
the range of forces, had been determined by Skinner for a 
given animal, selective reinforcement was supplied to above- 
average forces that had occurred only occasionally before 
The result was an almost immediate shift of response intensi. 
ties to a higher level. The successful force, which had pre- 
viously occurred about once in every ten pressings, now 
appeared as often as once in two pressings. In addition, the 
entire range of force values was shifted upward appreciably. 
Something like this may be seen in the Hays-Woodbury fig- 
ures of Table VII (page 172). The second column of this table, 
as mentioned above, shows the distribution of forces when 
there was a 21 -gram minimum requirement to obtain rein- 
forcement; the third column tells us what happened in the 
same animal when a new and greater force (38 gms.) was 
demanded. 

The shift in distribution of forces that takes place under 
such experimental conditions shows us the way in which we 
can set up a response that has a force not previously observed 
in the animal's bar-pressing. If, now, we selectively reinforce 
some of the more intense responses within the new range, we 
may shift the distribution again; and we may continue in this 
fashion until our animal is exercising his greatest possible 
force upon the bar. Skinner was able, through this procedure, 
to reach a force-level of one hundred grams, nearly one-half 
of the animal's bodily weight. This could only be achieved, 
of course, when the force required at a new level had been of 
sufficiently frequent occurrence at the level preceding; and 
even then the new level was reached very gradually by the 
animal. The entire procedure bears a marked resemblance to 
that described in Chapter 4 for the establishment of high 
response rates under periodic reinforcement at a fixed ratio. 
Indeed, we would have been justified in treating fixed-ratio 
P-R in the present context as a rate differentiation. 

Since force differentiation, in our example, had to be estab- 
lished in the face of the negatively reinforcing effect of the 
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extreme effort involved, setbacks in progress were quite com- 
mon. This was especially the case at the beginning of a day's 
experimental session, when considerable 'warming up' was 
required before the rat reached a level where practically all 
of his responses produced food. As with the licking response 
of Thorndike's cat, the animal tended to revert to an easier 
way of behaving from which he had to be 'lifted* on each 
successive day of work. 

The warming-up effect observed in differentiated bar- 
pressing has its parallel in various human activities. Coaches 
of athletics have long recognized the value of a limbering-up 
process in such sports as baseball, track, and crew-racing, and 
this apparently holds for any performance that requires 
above-normal exertion. Even such small-muscle activities as 
tapping with the fingers (Wells, 1908) or repeating the alpha- 
bet backward (Robinson and Heron, 1924) show similar 
changes during practice sessions in which a high rate of 
responding is required. 

A study of force-differentiation in a situation comparable 
to the one used with rats has been made by Murphy (1943) 
with human subjects. This investigator employed a modifica- 
tion of the well-known "pin-ball" machine in which a small 
plunger was pulled back against a spring and released to send 
a twenty-gram marble vip a slight incline into a hole. The 
hole, as well as the course taken by the marble, was hidden 
from the subject's view, but a flash of light at the front of the 
machine indicated the correctness of the response. This light- 
flash was under the control of the experimenter, who could 
make it contingent upon any degree of plunger-pull or who 
could eliminate it entirely if he wished. The distance of each 
plunger movement was recorded vertically and cumulatively 
on the paper-covered surface of a kymograph drum that 
jumped ahead horizontally about one-eighteenth of an inch 
with every completed response. Rate of responding was re- 
corded on a separate drum in a manner similar to that used 
with rats. 
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Murphy explored (i) the 'normal force* and 'normal rate' 
of plunger-pulling when all responses were followed by a 
light-flash; (2) the effect of periodic reinforcement upon re- 
sponse rate and upon the subject's resistance to "shifting" 
from one required response-magnitude to another; and 
(3) the difficulty of setting up differentiations when different 
"margins of error" were permitted. His results were in gen- 
eral agreement with those obtained from rats in the bar-press- 
ing studies, but were different in certain details. For example, 
the distribution of forces when all responses produced flashes 
of light bears little resemblance to that obtained by Hays and 
Woodbury (see Table VII). This is presumably due to the 
fact, recognized by Murphy, that there were certain differ- 
ences in the procedure employed in the two studies. In terms 
of the organism's capacity, the range of response forces was 
less in this than in the experiment using rats, the practice 
periods were probably not of comparable duration, the form 
of reinforcement was different, and so on. 

Murphy's study did not involve the reinforcement of all 
responses above a certain force-level, but specified an upper, 
as well as a lower, limit of magnitudes. This requirement 
brings his procedure somewhat closer to the everyday type of 
human differentiation where the range of response properties 
must not exceed a fairly narrow band of tolerance if our 
adjustment is to be effective. The results indicate that (i) as 
the experimenter restricts the tolerated range the number of 
incorrect responses required before a correct one is made 
grows larger; (2) there is an apparent improvement in the 
ability of subjects to shift back and forth from one response 
range to another as the number of such shifts increased; and 
(3) any form of periodic reinforcement is more effective than 
regular reinforcement in building up a resistance to shifting 
from one force-level to another. Such findings are clearly 
consistent with those obtained from studies with rats, and 
point to the fruitfulness of extending this area of research. 

The differentiation of force is, of course, but one form of 
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change that may result from the selective reinforcement of 
response. Topographical differentiation is just as common 
and is considerably more dramatic, since it may lead to ac- 
tions that are more clearly 'new.' Although few experimental 
studies have been aimed specifically at this problem, it is 
obviously present, along with stimulus discrimination, in 
many forms of training, and it is readily demonstrable at a 
non-quantitative level. A rat, for example, may be led, by 
gradual stages, to walk on his hind legs, jump through a loop, 
carry a marble in his mouth, or perform some other action 
that would never have occurred in a given experimental set- 
ting without the application of a differentiative procedure. 
The technique is always the same: some one variant of an 
already conditioned response is selectively reinforced until it 
becomes more frequent than the others; when this is achieved, 
a variant of the new response is treated in the same way. 
Through a series of successive approximations to the desired 
reaction, the behavior is altered until it comes to bear little 
or no resemblance to the first-conditioned form. 

When we think of differentiation, we perhaps think first of 
animal stunts or athletic skills, but this is much too narrow 
an application. Teachers of speech or of foreign languages, 
experts in time-and-motion study, instructors in the various 
arts and crafts all these, and many others, are regularly 
engaged in the formation of differentiated responses. Indeed, 
there is hardly a human pursuit which, in one aspect or an- 
other, does not illustrate the process in some degree. 

The accuracy or precision of a differentiated operant is 
ordinarily, perhaps always, a function of the discriminative 
capacity of the person who supplies the reinforcement. The 
teacher of whatever art or skill must be one who can distin- 
guish minute changes in the behavior of his pupil. He must 
be able to single out that variation of response which is in the 
direction of the improvement he desires, and he must do this 
rapidly if he is to make his reinforcement optimally effective. 
Regardless of his own differentiative ability, he must be able 
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to detect those shades and nuances, those slight deviations and 
minor refinements in the actions of others which must be 
strengthened or extinguished when they appear. 

Resume 

A word of recapitulation at this point may not be out of 
order. We have seen, in the present chapter, that organisms 
characteristically show response variability even in the ab- 
sence of detectable environmental change. A degree of 
stereotypy will result from regular reinforcement, but this 
stereotypy is never complete. We have also seen that the rein- 
forcement of one specific response probably strengthens 
others (response induction). It has been suggested that the 
source of variability lies in the fact that even positively rein- 
forced responses are subject to momentary depressions in 
strength through the negatively reinforcing consequences of 
their emission however faint and fleeting these consequences 
may be. Such depressions pave the way for the emission of 
alternative responses, some of which may also obtain positive 
reinforcement and may be even less freighted with negative, 
so that they take priority in terms of later occurrence. This 
happens unless there are strong counter-influences that is. 
unless the first response has been the only reinforced one or, 
perhaps, has been established under stronger motivation. 
This explanation of variability may be related to * 'alterna- 
tion" studies and to commonly cited examples of "least effort," 
but what we need most is further research in this field. With 
respect to differentiation, there is less room for uncertainty 
than in the discussion of variability. Given a degree of varia- 
bility, however accounted for, we have no trouble in modify- 
ing behavior independently of changes in exteroceptive 
stimulation. A marked novelty of response may even be 
achieved simply by reinforcing successive approximations to 
a desired mode of reaction and withholding reinforcement 
for responses that do not change in this direction. In this 
effect we have the basis for developing various skills. 
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Differentiation and Discrimination 

We began our discussion of operant behavior, in Chapter 
3, without consideration of the stimuli to which such be- 
havior might be related. Later, in Chapter 5, we showed that 
operants may come to be emitted in the presence of some 
stimuli and not in the presence of others. In this chapter, we 
have argued that responses are never completely stereotyped, 
and that they may be changed in their character independ- 
ently of any change in stimulus conditions. Our examples of 
response differentiation were chosen with an eye to pointing 
up this independence. Yet it must have occurred to you that 
many, if not all, differentiated operants are also discrimina- 
tive operants. Very early in an organism's behavioral develop- 
ment a specific mode of response becomes related to a more or 
less identifiable form of stimulation. Obviously, a rat is un- 
likely to engage in bar-pressing movements in the absence of a 
bar that can be seen or touched, just as a human subject in a 
reaction-time experiment is unlikely to make a key-tapping 
movement when no reaction-key is present. In many cases, 
of course, the relation of response to stimulus may not be so 
apparent, as when a rat is reinforced for standing on his hind 
legs or a student has been reinforced for raising his hand in 
class, but even in such instances a correlation may often be 
discovered; a rat does not emit his standing response when he 
is no longer in the experimental situation, and a student does 
not ordinarily raise his hand in the absence of his instructor. 

A simple example of the tie-up between discrimination and 
differentiation is the behavior of an animal in a T-box (page 
137). In the presence of one S D , he turns to the right; in the 
presence of another, he turns to the left. Similarly, in a reac- 
tion-time study, a person may respond with one hand to a 
green light and with the other to a red one. At a slightly more 
complex level, we have the subject's behavior in a situation 
where the specific stimuli and responses are greater in num- 
ber, as in Merkel's experiment (page 143) which employed 
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as many as ten alternatives. This situation, in turn, resembles 
the one in which students master International Morse Code, 
where no fewer than thirty-six basic stimulus patterns are 
each connected with a differentiated response pattern. When 
one learns to receive code, his problem is mainly discrimina- 
tive, since the written or spoken responses have already been 
well differentiated; in sending code, however, the problem is 
one of differentiation, since the discriminative work was done 
when the student learned to read his ABC's. 

The early development of a child's vocabulary provides 
many excellent illustrations of the combined processes of 
discrimination and differentiation. The vocalization of the 
child contains, at the beginning, a small assortment of basic 
sound elements (phonemes). With the maturation of the 
vocal mechanism, the number of these elements increases 
and, as they group themselves into patterns by virtue of the 
principle of "chaining" (see Chapter 7), they are also sub- 
jected to differentiation through the selective reinforcement 
supplied by parents and others. At the same time, the emis- 
sion of many of these patterns comes to be connected with 
relatively specific stimulus situations. Mama comes to be 
emitted mainly in the presence of the mother; dada may be 
adopted by the father as his name, and so on. At first, the 
reinforcement for this naming behavior is given lavishly, no 
great discriminative acuity or differentiative precision being 
required of the child. Later, as the educational process gets 
under way, reinforcement is more sparingly doled out, being 
given only when highly specific responses are emitted in 
equally specific stimulus situations. In fact, one might say that 
the educational process itself is largely a matter of establishing 
connections between well-discriminated stimuli and well- 
differentiated responses. 

In the differentiation of verbal responses, a frequently 
utilized training procedure is that in which reinforcement is 
made contingent upon the resemblance of a response to an 
S D that is itself the verbal response of another person. We see 
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this whenever a child utters a word in imitation of one 
emitted by a parent or teacher. The importance of this S D 
can hardly be appreciated by one who has never observed the 
difficulty with which totally deaf children learn to speak 
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FIG. 46. Actual records of the formation of a differentiated response 
chain by two students learning to send a Morse Code signal. On each 
trial the student tried to imitate with his own telegraph key the signal as 
sounded for him by a professional. The figure shows only the first 12 
attempts by these students, who were without any prior experience in 
receiving or sending the code, but who rapidly achieved an acceptable 
signal. (Courtesy of M. P. Wilson.) 

intelligibly. The very nature of their defect prevents such 
children from matching their own vocal productions with 
heard models. A substitute, for the deaf, is provided by vibra- 
tory or movement cues given to the child when his hand is 
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pressed to the jaw region of a speaking person. This stimula- 
tion may be matched to some extent by the vibrations aroused 
in the child's own speech apparatus during vocalization; but 
the match is far from perfect and other cues may also be em- 
ployedfor example, a visual matching of lip-movements 
through the child's use of a mirror. Even then, however, the 
reinforcement provided by the teacher's approval of the 
approximated sound-pattern remains, for these unfortunates 
the basic factor in the educative process. 

Training in speech differentiation has a close parallel in 
the procedure through which radio operators learn to trans- 
mit code signals. Here, too, a well-accepted teaching device 
is that in which the student attempts to reproduce a model 
signal transmitted by the instructor. In the absence of this 
auditory S D , progress is seriously retarded, and even a highly 
trained operator may find his task upsetting when no tone- 
patterns or associated clickings result from his movements. 
Although experimental evidence is lacking, there is every 
reason to believe that, under such circumstances, the intel- 
ligibility of his transmission suffers appreciably. As compared 
with speech differentiation, of course, this skill requires very 
little precision of movement, but the fundamental process 
appears to be the same. If adequate responses are to be estab- 
lished or maintained in the absence of heard models, the 
presentation or withdrawal of reinforcement becomes all- 
important. 

NOTES 

Muenzinger's observations of variability in the lever-pressing response 
of guinea pigs were confirmed in a later study by Muenzinger, Koerner, 
and Irey (1929). Using a lever the cross-bar of which was removed to 
eliminate two-paw responses, they noted the persistence of variability 
in a single mode of response during 600 solutions of the problem. Three 
pigs were given "guided" training, with only right-paw responses rein- 
forced, and three were given "unguided" training, any effective response 
being reinforced. As you might expect, other than right-paw movements 
decreased during guided training, but the amount and change of right- 
paw patterns was about the same for the two groups. One important 
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difference appeared: there was, in the guided group, a greater develop- 
ment of "accessory" movements useless responses that accompanied or 
preceded the effective ones. Thus, a head movement might accompany 
lever pressing with the paw, or a series of light taps might precede a 
strong downward push. This unnecessary behavior is attributed to the 
greater precision of response required from the guided group, and is 
likened to the grimaces and tongue-twistings of a child when learning 
to write. Similar observations have, indeed, been reported of human 
behavior in learning situations where the tasks were of considerable 
difficulty, and it has been noted that the amount of such accessory re- 
sponding decreases as the task nears completion (Stroucl, 1931; Ghiselli, 
1986). 

Entertaining items, suggestive of response induction, often turn up 
in unexpected places. There is, for example, the tale of the British 
surgeon who trained his right-handed pupils to perform operations 
with the left hand, and who is reported to have said: "Train the left 
and the right will look after itself." And there is also the report that an 
increase in the muscular development of one arm, as a result of special 
exercise, will lead to an increased development of the other. Before 
interpreting this to mean a double return for work done, we should 
remember that, according to the induction principle, the exercise itself 
was not restricted to one arm alone! 

Hull's (1943) conception of the way in which a response may produce 
its own momentary depression in strength is attributed by him to two 
of his former pupils, Mowrer and Miller. Mowrer's version of the idea 
is found in a collaborative study with Jones (1943); Miller's is presented 
in a book, Social learning and imitation (1941), of which Dollard was 
co-author. The Miller-Dollard book is a good companion-piece to the 
present text, being one of the early approaches to the viewpoint here 
outlined. The Mowrer-Jones study dealt with resistance to extinction 
as a function of the force required of rats in depressing a bar. Their 
findings point to an inverse relation between the variables the greater 
the force the fewer the responses during extinction. Solomon's (1948) 
review of the entire problem contains much more material than we 
have cited. For example, he connects the principle with studies of work 
and fatigue, extinction and spontaneous recovery, discrimination (by 
rats) of pathway distances and inclinations, psychophysical judgments, 
and the tendency of human beings to avoid the repetition of identical 
guesses. 

Researchers and theorists whose names have often been connected 
with the 'law of least effort' are Wheeler (1929), Gengerelli (1930), 
Tsai (1932), Lorge (1936), Waters (1937), Crutchfield (1939), and Zipf 
(1948). Hull's interesting treatment of the concept is to be found in 
his Principles of behavior (1943). Some telling criticisms of 'least effort* 
as a basic law may also be found in Guthrie's Psychology of learning, 
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IN LEARNING [the Lord's Prayer] we repeat it; that is we pronounce the 
words in successive order, from the beginning to the end. The order 
of the sensations is successive. When we proceed to repeat the passage, 
the ideas of the words also arise in succession, the preceding always 
suggesting the succeeding, and no other. Our suggests Father, Father 
suggests which, which suggests art; and so on, to the end. How remark- 
ably this is the case, any one may convince himself by trying to repeat 
backwards, even a passage with which he is as familiar as the Lord's 
Prayer. 

James Mill, Analysis of the Phenomena of 
the Human Mind, 1829 

The Principle of Chaining 

The quotation above, from a famous British thinker, gives 
us a ready-made introduction to the central theme of this 
chapter. When stripped of its subjectivity, it constitutes a 
clear recognition of the fact that responses commonly occur 
in series rather than as isolated behavioral units. Stated in 
terms with which you are more familiar, it amounts to this: 
one response commonly produces the stimulus for another. 

All along in the preceding pages of this book, we have 
treated the single response or the single stimulus-response 
relation as the principal topic with which psychology is con- 
cerned. Except for a brief consideration of compound stimuli 
in Chapter 5, we have been careful to postpone the discussion 
of more complicated matters until the simple ones had been 
made as clear as we could make them. This has been a nec- 
essary approach and the one employed in all scientific exposi- 
tion, but it may have occurred to you that our treatment was 
incomplete that we failed to recognize the fact that one 
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stimulus-response relation is seldom isolated completely from 
those which precede or follow it. Such a criticism is justified 
although, in carrying out our scheme of presentation, it 
would have been premature to discuss the matter before we 
reached this chapter. 

If you were to describe in detail the behavior of a well- 
conditioned white rat when placed in the bar-pressing ap- 
paratus for a period of regular reinforcement, you might 
come out with something like this: 

He ran immediately to the front of the box where he came to a stop 
in a position facing the bar; then he raised himself on his hind legs 
and put his forepaws on the bar; with paws in place, he gave a quick 
downward thrust which depressed the bar sufficiently to activate the 
food-magazine and discharge a pellet of food into the tray below; then 
he lowered himself to the tray and seized the pellet with his teeth and 
paws, after which he settled back on his haunches and began eating 
the pellet. 

In this fairly accurate description, there are at least six dis- 
tinct reflexes upon which good observers might be expected 
to agree. Without going into an unwieldy degree of specifica- 
tion, we may list them as follows: 

Reflex Stimulus Response 

5 Bar-location Approach 

4 Visual bar Rising 

3 Tactual bar Pressing 

2 Apparatus noise Lowering 

i Visual pellet Seizing 

o Pellet-in-mouth Chewing 

The discriminative stimuli for these responses are not as 
easily identified as the responses themselves. This is especially 
the case for the end reflexes in the series (bar-location and 
pellet-in-mouth are rather ill-defined stimuli), but it holds 
also for reflex i (the pellet might be touched or srnelled rather 
than seen) and, to a lesser degree, for all the rest. It must be 
remembered that we are dealing with a series of operants the 
stimulus control of which is never as strictly or clearly defined 
as when we deal with respondents. Control may, of course, 
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be established we have shown this in Chapter 5- and we are 
safely conservative in assuming the operation of S D 's in the 
present situation, but a highly specific designation of stimuli 
can hardly be made in the absence of suitable tests. 

The responses of our list are not exactly equal in status. 
The approach response of reflex 5 will not often occur as a 
member of the series under conditions of regular reinforce- 
ment. After his first bar-pressing response, the rat is in posi- 
tion for the next response and will usually remain so during 
most of an experimental session. When the food has been 
swallowed, he will usually raise himself to the bar imme- 
diately and press again. We might even expect that the inges- 
tion of food itself would come to serve as an S D for the 
reinstatement of the rising response. The other responses of 
the series are, of necessity, always present and may easily be 
observed with a single exception: the chewing response may 
occasionally be absent (as when the rat loses his pellet) or 
difficult to detect (as when soft food is used and eating sounds 
cannot be heard). 

In addition to these six reflexes, we could have suggested 
more. Reflexes of ingestion are known to follow the chewing 
response; and "approach" is made up of several reflexes rather 
than one. If we attempted to specify them all, it would be a 
difficult technical task and no purpose would be served here. 
In fact, for our present purposes, it will be simpler and 
equally legitimate to move in the opposite direction and to 
consider the following list as representative. 

Reflex Stimulus Response 

4 Visual bar Rising 

3 Tactual bar Pressing 

2 Apparatus noise Lowering 

i Visual pellet Seizing 

This reduction leaves us with a clearly observable and reg- 
ularly recurring sequence or chain of responses, with stimuli 
that are effective under ordinary experimental conditions. 
Yet we know that a visual bar is not a necessity, since a dark- 
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ened box does not eliminate bar-pressing; similarly, a visual 
pellet is not essential to eating behavior, because a blind rat 
would also eat. We know, too, that, in addition to noise made 
by the apparatus, such as the click of the food-magazine, a 
discriminative cue may be provided by the sound of the pellet 
as it drops into the food-tray; and we know that all of these 
auditory stimuli may be eliminated or reduced in intensity 
without disrupting the behavior sequence appreciably. Com- 
pound S D 's are present at every stage of response; and these 
S D 's may be exteroceptive, originating outside of the organ- 
ism, or proprioceptive, arising directly from the muscular 
movements themselves, that is, inside the organism. The 
important fact is that each response is undoubtedly depend- 
ent upon some form of discriminative stimulation. 

The paradigm shown below may help you to clarify the 
operation of our reduced chain. This paradigm may be read 
as follows: 



S D * (the visual bar) leads to R 4 (the rising response). This is 
followed by S D3 (the touch of the bar on the paws), which 
leads to R 8 (pressing). The pressing is followed by S D2 (ap- 
paratus noise), which leads to R 2 (the lowering response); and 
this response is followed by S D1 (sight of the pellet), which 
leads to the seizing response, R lf which initiates eating. 

Chains and New Units 

Here, then, we have a chain of reflexes in which each re- 
sponse produces the stimulus for the next. A striking aspect 
of such a chain is the over-all smoothness of transition from 
one link to another. Indeed, a well-conditioned rat gives the 
appearance of making, not four responses, but one. There are 
no pauses, no hitches, no jerks in the sequence. One response 
seems to flow easily and rapidly into the next. It is not hard 
to understand why an observer of the animal should speak of 
a single act of bar-pressing in a field of stimulation. It is easy 
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to overlook the genesis of the behavior from a series of dis- 
crete units linked into a continuous and efficient performance. 

The multi-membered nature of the act can be demon- 
strated, however, even after strong conditioning. Suppose, 
after considerable training, we permit an animal to press the 
bar and produce the apparatus noise, but we withhold the 
food. This is the usual extinction procedure, and you know 
what kind of extinction curve to expect. In terms of our 
chaining paradigm, we have broken the sequence at a point 
between R 2 and S D1 . The first three reflexes occur with de- 
creasing frequency until a very low level of strength is 
reached. But what happens to the fourth reflex, the pellet- 
seizing? Has it been extinguished too? You can guess the 
answer: it has not. The rat will still seize any pellet that is 
placed before him. And this suggests that at least the final 
link in our chain is functionally separable from anything that 
preceded it. 

But what about the behavior that preceded the pellet-seiz- 
ing; may it not be a natural totality rather than a chain? 
Again we can put the matter to test. Suppose we begin extinc- 
tion by permitting the bar-pressing response (R 3 ) to occur, 
but not permitting either the apparatus noise or the food to 
follow. Now, in terms of our paradigm, we have broken the 
chain at a point between R 3 and S D2 , one link in advance of 
the former break. Suppose, further, that we carry out this 
extinction until very few pressing responses are being emitted 
by our rat. When this stage is reached, let us reintroduce the 
apparatus noise but continue to withhold food reinforcement. 
Now, when the animal raises himself to the bar and depresses 
it, he is stimulated by the noise which was formerly a part of 
the S D compound that led him to drop down to the tray. 
What will be the effect of this stimulation upon his subse- 
quent behavior? The question has been answered experi- 
mentally. Following the reintroduction of the apparatus 
noise, the almost-extinguished bar-pressing will immediately 
recover strength. That is to say, the bar-pressing is recondi- 
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tioned for a time and then yields a new extinction curve. This 
is shown in Figure 47, where the arrow indicates the point at 
which the apparatus noise was reinstated. 
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FIG. 47. The separate extinction of chained reflexes. Extinction of 
bar-pressing was begun in the absence of the S D (click) for tray-approach. 
At the arrow, each bar-press was again permitted to produce the click, 
with a resultant spurt in responding. (After Skinner, 1938.) 

Two significant facts emerge from this result. First, the 
mere production of the previously absent S D strengthens the 
responses that came before it in the bar-pressing sequence. 
Why it has this effect is a question to which the following 
chapter addresses itself, so it need not be answered here, but 
there is no denying that it does act in such a way. The appara- 
tus noise, under these experimental conditions, clearly serves 
to reinforce the behavior that, earlier, had been practically 
extinguished; and it does so in spite of the fact that it is not, 
during extinction, followed by food reinforcement. Secondly, 
the near-extinction of the first two members of the chain 
apparently did not affect the third. This answers our question 
about the totality of the behavior that preceded the pellet- 
seizing reflex. Whatever the oneness that this behavior ex- 
hibited, its chained origin is inescapable. The behavior is 
made up of elements that are by no means arbitrary and 
imdemonstrable. We can single out these elements if we try. 

It would, of course, be wrong to say that a closely-bound 
chain of responses does not come to function as a unit. When 
we break a chain at some point, all of the responses emitted 
up to that point decline in strength together in an orderly 
fashion. It is this very orderliness that justifies our speaking 
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of a response when the entire chain is emitted although 
perhaps it would be less confusing if we spoke of the chain 
as an act composed of several responses. Moreover, when 
viewed by an observer, the elementary components of the act 
ripple off in rapid succession and with considerable stereo* 
typy on almost every occasion of their emission. 

Ordinarily, in such a chain as that involved in bar-press- 
ing, we record the occurrence of only one movement the 
actual depression of the bar but there is no reason why we 
could not measure the frequency of others. We might, for 
example, record the approaches to the food-tray which follow 
the pressings, or the risings to the bar before the pressings. 
By such means we could study in more detail the building-up 
and breaking-down of the bar-pressing chain. Further inves- 
tigation will undoubtedly employ such procedures; indeed, 
a number of exploratory attempts have already been made 
(e.g., Arnold, 1947). 

An interesting variation in the technique of chaining is 
shown in an experiment by Gellermann (1931). This inves- 
tigator found that monkeys, after many practice trials, could 
solve a double-alternation problem in which they had to lift 
the lids of two adjacent boxes, one at the left (L) and one at 
the right (R), in a sequence LLRRLLRR, for a food reward 
that appeared after each response of the series. When this 
chain was formed, the monkeys displayed an ability to extend 
the alternation of paired responses well beyond the point at 
which the series was ordinarily terminated. In a similar ex- 
periment, Schlosberg and Katz (1943) demonstrated that 
white rats can learn to alternate between two upward and 
two downward pushes of a bar when food is given at the end 
of a four-membered series. These rats, like the monkeys in 
Gellermann's experiment, were able to make many successive 
double alternations. 

Relevant observations, although not described in detail, 
have apparently been made by the psychiatrist, Masserman 
(1946). He taught cats to obtain food by pressing a pedaJ 
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three times in succession before running to a food-box. When 
these "threes" were well established, the procedure was 
changed to permit a single pressing to secure reinforcement. 
Later, when this new manner of responding had replaced the 
first-conditioned threes, it was subjected to extinction. As the 
"singles" fell off in frequency, the cats regressed (see page 81) 
to the threes. Such observations are interesting and provoca- 
tive, and they should be checked with follow-up studies. 

In the field of verbal behavior, it would seem that nothing 
is more obvious than the unitary character of chains. Nearly 
every word on this page is an apparent example of a unit 
which was originally no more than a vocalized series of un- 
related and disjointed elements. What is the spoken word 
example itself but a temporal pattern of sounds that were at 
first distinct and isolated but are now so well chained as to 
be practically irreducible except under the special conditions 
of training in analysis? And this is not all; larger groupings 
than individual words may act in unitary fashion. For ex- 
ample, inasmuch as, in fact, and so forth all possess this 
character. Still larger groupings may betray the same cohe- 
sion, although to a lesser degree. Complete the following: 

bread and ; salt and ; time and ; hue 

and Or these: as quick as a ; as sly as a ; 

as strong as an ; as wise as an ; as quiet as a 

For most of us, the missing link in these oft-recurring 

chains is quickly added, although the completion will not be 
the same for every person. A moment's thought will suggest 
other, and perhaps better, examples. 

The problem of the functional unit of behavior, whether 
Verbal or non-verbal, is one that has not yet received the 
experimental attention it deserves. It would appear that there 
are many chains of behavior the units of which are themselves 
chains of still more basic units that have been welded together 
by the influence of reiniorcing agents. Table IX gives data 
from a study by McCarthy (1930) on the relation between the 
age of pre-school children and the average number of words 
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used in fifty sentences emitted in a simple observational 
situation. The figures in this table, which show a regular 
increase in sentence length with increased age, are typical of 
results obtained by other investigators for these age-levels. In 
addition, Davis (1937) has shown that the length of spoken 
sentences continues to increase with age until, at nine and a 
half years, the average sentence contains about seven words. 
When written compositions are measured (Heider and 
Heider, 1940), there is evidence for an increase from about 
ten words per sentence at the eight-year-old level to about 
fourteen words at the fourteen-year-old level. All this, of 
course, is exactly what we would expect if new units of verbal 
behavior were developed on the basis of the chaining of 
smaller units. In fact, it is possible that the number of func- 
tional units employed at different ages does not change very 
much. A word, at an early age, may be a single unit, whereas 
at a later age, it may be merely a part of a single unit. 

Table IX 

THE AVERAGE LENGTH OF SENTENCES IN SPOKEN 

LANGUAGE AS A FUNCTION OF AGE. 

(From McCarthy, 1930) 

Age in Years: 1.5 2 2.5 3 3.5 4 4.5 

Number of Words: 1.2 1.8 3.1 3.4 4.3 4.4 4.6 

Homogeneous and Heterogeneous Chains 

Some writers have distinguished between two kinds of 
chaining (e.g., Arnold, 1947). In one, a given stimulus is 
followed by a specific response which produces another 
stimulus which is, in turn, followed by another response, and 
so on. This type, in which the successive links are different, 
is called heterogeneous chaining. The bar-pressing chain is 
of such a sort, and so is the chain involved in saying "All men 
are created equal." On the other hand, we can imagine a 
chain composed of stimuli and responses which are prac- 
tically identical in each successive link. This has been called 
homogeneous chaining. The triple pedal-pressing of Masser- 
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man's cats approximates such a situation, as does the "rah- 
rah-rah" of a college cheer. 

Cases of pure homogeneity in chaining are rare; strictly 
speaking, they probably do not exist. Even the "rahs" of a 
cheer may be discriminably different to the listener in some 
aspect, say their force; and the successive responses in triple 
pedal-pressing may differ slightly in one or more of their 
properties. Moreover, in many instances, the homogeneous 
links are clearly bounded by heterogeneous ones in the com- 
plete chain. Thus, prior to the first of three pedal-pressings is 
an approach response, and after the third there is an advance 
to the food-tray. At the verbal level, the rarity of homo- 
geneous links is reflected by the scarcity or complete absence 
of words in any language where the same syllable is repeated 
more than twice in succession. In Chinese, which is a mono- 
syllabic language, repetition of the same word three times is 
so infrequent as to be noteworthy. An exception, kan kan kan 
(Mandarin for look and see) is uttered in almost a monotone. 
The Shanghai form, ku ku ku, exhibits a greater variation in 
both stress and pitch. 

Related to the above is the difficulty encountered when we 
attempt to teach rats or other animals to solve the double- 
alternation problem mentioned in the preceding section. 
Some experiments make use of the temporal maze (see Figure 
48). In this maze, the subject may be required to make a 
left-left-right-right sequence of turns when confronted with 
the same choice point on four successive occasions during his 
running. Mastery of the problem may be impossible for some 
animals because of the similarity of the repeated elements 
(the S D 's as well as the responses) in the chain. The situation 
resembles that in which a verbal chain is composed of 
formally similar responses each of which produces the S D for 
the next. ' 'Tongue-twisters" are constructed on this basis. 
Pluto, the tutored poodle, practised triple pedal-pressing. 
The difficulty of emitting such a verbal sequence rapidly may 
partially explain the infrequency of polysyllabic English 
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words in which the same syllable precedes two different syl- 
lables. Witness the trouble we have with such a common 
word as statistics. We can sympathize with the child who 
struggled to say to his father: You put your toothbrush in my 
cup and you put my toothbrush in your cup! 
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FIG. 48. Floor plan of a temporal maze. E is the point of entrance 
from which the animal moves down the central pathway to the choice 
point, where he may turn right or left. The hinged doors (i, *, 3, 4) are 
manipulated by the experimenter, who prescribes the path by which the 
animal can return to E. Reinforcement, such as food, may be provided 
at E after each run, or only after a series of runs, such as left-left- 
right-right. (After Hunter, 1928.) 

It has been suggested by Skinner (1938) and Mowrer and 
Jones (1945) that the procedure of periodic reinforcement is 
conducive to the formation of new units, as when a number 
of bar-pressings are followed by a pellet of food. Mowrer and 
Jones carried out an experiment in which resistance to extinc- 
tion of this response was related to schedule of reinforcement. 
Five groups of rats were used. One group was regularly rein- 
forced during training; one group was aperiodically rein- 
forced; and three groups were reinforced at fixed ratios of 
1:1, 2:1, and 3:1 respectively. Their findings led them to 
conclude that new "patterns" or units of response could be 
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set up; and that the number of functional responses during 
extinction was nearly the same for the different groups of 
animals, in spite of an increased number of pressings as the 
ratios became larger. How far we can go with this type of 
reasoning is questionable. It seems unlikely that the rat 
would be able to make more than two or three bar-pressing 
responses into a single unit through chaining, but further 
experimentation ought soon to give us an answer. Close 
observation should readily disclose whether two or more 
pressings come to act like one, either during fixed-ratio train- 
ing or during a subsequent extinction period. 

The degree of similarity among the successive links of a 
chain would seem to set a limit to the size of a unit that can 
be developed. The number of different responses available 
to an organism is by no means infinite. Sooner or later, as 
attempts are made to add links to a chain, generalization and 
induction will take their toll. More and more discrimination 
and differentiation will be required, until a saturation point 
is reached and new links are added only at the expense of 
the old. 

Chaining and Delayed Reinforcement 

Facts sometimes lose their simplicity when analyzed but, at 
the same time, get to be more understandable. We now deal 
with such a case. At several points in this text, it has been 
stated, or implied, that an operant is conditioned more rap- 
idly when reinforcement follows immediately upon emission 
of the response. This is supported by a number of investiga- 
tions and is a truism for animal trainers and educators in 
various fields. The fact that conditioning is slower as the 
reinforcement is more delayed has led some theorists, like 
Hull (1943) to speak of a 'reinforcement gradient' and to 
specify the limits of delay beyond which reinforcement is no 
longer effective in strengthening a response. 

Delay of reinforcement can now be seen to involve the 
formation of reflex chains. When we say that a bar-pressing 



CHAINING 2O9 

response, for example, has been reinforced after a five-, ten-, 
or fifteen-second delay, what we really mean is that we have 
immediately reinforced some other response which occurred 
five, ten, or fifteen seconds after the bar pressing. During the 
time of 'delay,' the animal does not stop behaving; he merely 
idoes something else over which we may have little control. 
We set up a chain of reflexes, of which bar-pressing is an early 
link. The bar-pressing link cannot be strengthened until we 
have strengthened the later ones, and this takes time. Even in 
what we treat as a case of immediately reinforced bar-press- 
ing, there is at least one later link in the chain the response 
to the food-tray, which follows the actual pressing and is 
conditioned first. The immediate reinforcement for the press- 
ing is of a 'secondary' sort (to be described in the next chap- 
ter), but it is nonetheless immediate (Spence, 1947; Grice, 
1948). 

The presence of the last (and first-reinforced) link in such 
a chain is sometimes unsuspected and may go unobserved. 
For example, when one conditions a white rat to turn off a 
light, in the manner described on page 62, the situation is 
presumably one of 'immediate* reinforcement: the light goes 
off within a fraction of a second after the bar-pressing re- 
sponse is made. It may be shown, however, that the reinforc- 
ing effect of light-removal comes later, when the animal is 
doing something else holding down the bar, turning away 
from it, rising above it, and so on. Probably because of the 
visual lag involved, the effect of light-removal is not felt at 
the instant of bar-depression. We do not reinforce first the 
response that we are measuring; rather, we make way for a 
'superstitious' response to develop, and we are compelled to 
establish a longer chain than we intended. 

A comparable situation exists in certain sports and games, 
where the first-conditioned response actually follows the effec- 
tive one. In bowling and in billiards, the last link in the chain 
is the one often referred to as "body English" the posture or 
the movements which are the true accompaniments of a sue- 
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cessful throw of the ball or shot with the cue. Similar em- 
bellishments may be observed in the behavior of amateur 
baseball pitchers, basketball players, and golfers. 
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FIG. 49. The effect of delaying the primary reinforcement upon the 
rate of mastering a discrimination. Six groups of hungry animals were 
trained to make a white-black discrimination in a discrimination box 
where the "correct" choice (response to S D ) led to food. The response 
to S D , however, was reinforced after different time intervals, the delays 
varying from o to 10 seconds. The curves show, for successive blocks of 
20 trials, the per cent of correct choices made by the animals in each 
group. Note that the longer the delay the slower the learning; until, 
with a lo-second delay, the animals do not get past a chance (50 per 
cent) performance. This experiment was carried out under conditions 
especially designed to eliminate any 'secondary reinforcement* inter- 
vening between the choice-point response and the ultimate primary 
reinforcement. This elimination reduced the size of the maximum delay 
previously thought possible, and was the effect predicted by the experi- 
menter who believed that long delays involved the formation of a 
response chain composed of secondarily reinforced units. (After Grice, 
1948.) 



Exteroceptive and Proprioceptive S D 's in Chaining 

The preceding sections may have suggested to you another 
distinction that is sometimes made between reflex chains. In 
most cases, the obvious discriminative stimuli for the succes- 
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sive responses of a chain are in the external environment 
they belong within the exteroceptive class. One response puts 
the organism in a position such that a new outside stimulus 
is presented. That is, one response produces the external 
stimulus for the next, as in the behavior of bar-pressing and 
maze-running. Yet you may have surmised that chains are 
sometimes composed of responses the principal stimuli for 
which seem to be other responses. This seems to be especially 
true of verbal behavior. When we repeat a well-worn phrase, 
it often seems that one word leads to another directly, rather 
than by way of the exteroceptive S D 's (sounds) produced by 
our speech. We emit the accustomed sequence in a more or 
less automatic fashion, and we would be hard put to show 
that each component word depended upon some outside 
stimulus which the preceding word had produced. We are 
more likely to say that one response led to another without 
any reference to changes in the external environment. 

We have assumed all along that responses are responses-to- 
stimuli. Now, when we talk about a response that leads to 
another, we imply that a response can be a stimulus for an- 
other. But how can we speak of a response as a stimulus 
without dealing in the worst sort of confusion? Fortunately, 
there is a well-accepted answer to this question. Early in the 
nineteenth century it was discovered that we possess a muscle 
sense that there are actual receptor organs within our 
muscles, tendons, and joints which are excited by the move- 
ments of these effectors. This came to be known later as the 
kinesthetic (movement-perceiving) sense and to it was 
ascribed our subjective awareness of the position or changes 
in position of the movable parts of our bodies. As a more 
objective psychology developed, the term proprioceptors was 
adopted as a name for these sense-organs and proprioceptive 
was applied to the stimuli that excited them (see pages 4-5). 

Evidence for the importance of this type of movement- 
produced stimuli accumulated throughout the years. Today 
we have every reason to believe that responses may produce 



212 PRINCIPLES OF PSYCHOLOGY 

directly the discriminative stimuli for further responses, even 
in the absence of exteroceptive changes. Thus, we are willing 
to ascribe to the influence of proprioceptive S D 's the fact that 
eye-movements may provide cues for our judgments that 
objects are at different distances; that arm, wrist, and hand 
movements give the cue for judging one lifted weight to be 
heavier than another; that widespread muscular tensions may 
set off the responses that help us to maintain posture and 
right us when we are off balance; and so on. It is difficult to 
isolate and observe the action of such stimuli, because of the 
locus of the receptors involved and because they commonly 
accompany the stimulation of other sense organs, but we can 
feel relatively safe in asserting that whenever one response 
leads to another, and no exteroceptive cue can be discovered, 
a proprioceptive cue can be inferred. 

Probably the best examples of chaining in which proprio- 
ceptive stimuli may predominate are to be found in the 
sphere of verbal behavior. Verbal chains, once established, 
may occur in the absence of exteroceptive stimuli. Spoken 
words, of course, produce sounds which become exteroceptive 
cues for the speaker himself and thus play an important part 
in the formation of chains; but when these chains are once 
set up they may be emitted sub-vocally or sub-audibly, in 
which case the principal S D 's are proprioceptive. When we 
silently rehearse a speech or utter a prayer, each response 
seems to depend upon the preceding one for its emission, 
which is another way of saying that each response is to the 
proprioceptive stimulus aroused by the preceding response. 

Covert Response 

To most persons there is nothing objectionable in the view 
that overt or observable behavior may be caused by factors 
that are hidden from the observation of others. They will tell 
you without hesitation that many of their actions are inter- 
nally instigated. They will often report a train of "associated 
ideas" which led to the performance of some overt action. 
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But they will not as readily identify these "ideas*' with mus- 
cular responses to proprioceptive stimuli. Common sense 
tells them that there are ideas and actions, either of which 
may lead to the other; common sense does not tell them that 
associations of ideas are equivalent to chains of reflexes which 
differ from observable ones only in the magnitude of the 
responses involved. 

Yet, throughout the ages, the notion has often been ad- 
vanced that thought and action are not always distinct that 
some thinking, at least, was no more than inner speech. Plato, 
among the ancients, held such an opinion, and so did others 
who came later. In the nineteenth century, Alexander Bain, 
one of the founders of modern psychology, treated thinking 
as "restrained speaking or acting." Later still, John Watson 
described thinking exclusively in terms of covert response, 
either sub-vocal talking or "sub-gestural" gesturing. He 
argued that children, at first, think out loud and come to 
speak silently only through parental admonition; that adults 
who are withdrawn from social contacts tend to think, as well 
as read, aloud; and that deaf-mutes think and dream with 
their fingers. He suggested that sensitive measuring instru- 
ments would ultimately disclose tiny muscular movements 
in every instance where thinking occurred. 

Support for such beliefs has come, in recent years, from 
several experimental studies in which minute electrical 
effects or "action currents" of muscular contractions have 
been magnified and recorded. Jacobson (1932) gave human 
subjects extensive training in relaxation (to reduce the gen- 
eral level of electrical discharges from the muscle groups to 
be tested) and then, with electrodes placed in the region of 
certain muscles, asked each subject, while in the relaxed state, 
to engage in imaginal and thinking activities. In one experi- 
ment, when the electrodes were fastened near the flexor 
muscles of the right arm, the subject was told to imagine rais- 
ing a cigaret to the mouth. When such imagining was carried 
out, there was a corresponding electrical effect an effect that 
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was not registered in the right-arm electrodes when a move- 
ment of the left arm was imagined. Similarly, and more strik- 
ingly, action currents were recorded from eye-moving muscles 
when the subject was asked to visualize such an object as the 
Statue of Liberty or the Eiffel Tower. Finally, with electrodes 
attached to tongue- and lip-moving muscles, currents ap- 
peared when the subjects engaged in mental multiplication 
or when they recalled, in thought, the words of a song or the 
meaning of such a term as incongruous or eternity. These and 
related observations led Jacobson to conclude that covert 
muscular response was an essential condition of thought and 
imagination. 

The experiments of Max (1935, 1937) give dramatic sup- 
port to Jacobson's conclusion. Max used as his principal 
subjects nineteen deaf-mutes who were adept in sign-lan- 
guage. By attaching electrodes to both forearms of a subject 
simultaneously, he was able to get action-current records of 
finger and hand movements under conditions in which no 
overt response could be detected. Records were taken while 
the subject was in the waking state preparatory to sleep, and 
while he was actually asleep. With respect to the sleep rec- 
ords, Max discovered that when strong action-currents 
appeared and his subjects were awakened immediately there- 
after, they almost invariably reported having been aroused 
from a dreamwhich they often described in great detail. 
When, however, they were awakened in the absence of these 
strong discharges, they almost never reported that they had 
been dreaming. (Contrary to the popular belief that fairly 
lengthy dreams may take place in only a few seconds of time, 
Max found that "dream reactions" in his experiment usually 
involved two and one half minutes or more of responding.) 
When mild tactual stimuli were applied to the subjects dur- 
ing sleep, covert muscular responses followed in about 65 per 
cent of the cases. In some instances, these stimuli were suf- 
ficient to initiate "dream reactions/' Moreover, when deaf 
subjects, in the waking state, were given various "abstract 
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FIG. 50. Action currents from arm muscles during real and imaginal 
weight lifting. The upper curve is for the amount of electrical discharge 
when the weights indicated on the abscissa were actually lifted. The two 
lower curves are for imaginal lifting of these weights, one for times 
when "clear" images were reported, and one for "fair." (After Shaw, 
1940.) 
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thinking" problems to solve, they showed a far greater degree 
of covert finger and arm movement than did normal subjects 
under the same circumstances. 

Additional support for a response interpretation of mental 
activity comes from a study by Shaw (1940). This investigator 
trained a number of normal adults to report upon the rela- 
tive heaviness of several small weights which they lifted one 
at a time in random order during several experimental ses- 
sions. These reports presumably depended upon propriocep- 
tive stimuli aroused by the lifting movements themselves 
that is, the lifting response provided the S D for the weight 
judgment. But this is not the main point. When his subjects 
were well trained in lifting the different weights, Shaw used 
the electrical recording technique and found consistent 
differences in the amount of action-current produced when 
the subjects were asked to imagine lifting each of the weights. 
The amount of electrical discharge varied together with the 
amount of weight that the subjects imagined they were 
lifting. 

These results are in accord with those of Ewert (1933) who 
measured the eye-movements of college students during the 
reading and recall (oral or silent) of stanzas from John Gil- 
pin's Ride. Ewert was led to his study by observing, in one of 
his students during an examination, eye-movements which 
seemed to resemble those made in reading a textbook assign- 
ment! His experimental results showed that, in oral or silent 
recall of previously read material, the frequency of move- 
ments was approximately equal to that observed in the orig- 
inal reading. While this study was not concerned with truly 
covert responses, since the eye-movements (ordinarily un- 
noticed) could be subjected to direct observation, it does 
point to the importance of response in what is commonly 
thought of as an Imaginal* sort of activity. 
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Covert Response in Code Learning 

We have, then, considerable evidence pointing to the exist- 
ence, in human beings, of stimulus-response relationships 
which are ordinarily hidden from objective observation. In 
view of this evidence, we do not hesitate to affirm the exist- 
ence of chains of reflexes in which at least some of the ele- 
ments are covert. Even if we refused to trust introspective 
reports of such chains, we would probably feel justified in 
assuming their reality. 

The assumption of covert chaining serves to clarify and 
bring within the same conceptual framework a number of 
apparently unrelated observations. Let us consider, in the 
present section, the process of learning to receive Morse 
Code. If, under the code-voice method of training described 
in Chapter 5 (page 147), one observes carefully the behavior 
of a beginning code-student, two things soon become obvious. 
(i) The time required for responding to a signal by printing 
a character (letter or digit) is gradually reduced as practice 
continues. Early in training, the student makes use of all the 
time available between the presentation of the signal and 
the announcement, by the instructor, of its name. Later on, 
the appropriate response comes more and more quickly, well 
in advance of the identification. The latency of the response 
becomes shorter and shorter. (2) Along with the decrease in 
latency goes another change. A student may at first exhibit 
a marked degree of apparently superfluous activity during the 
latent period. After the signal sounds, and before he prints 
his character, various interesting reactions may occur. He 
may tap with his pencil or his foot, make nodding movements 
with his head, whistle softly to himself, or murmur some 
equivalent of the signal, such as "di-dah" or "di-di-dit." Only 
after such activity does he make the printing-response. Grad- 
ually, however, there is an abbreviation of this activity, until 
overt intervening responses may no longer be observed. Yet, 
even at this point, the student may report that he thinks 
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about the signal before he makes his final reaction to it. Thus, 
he may echo it, visualize it, or respond in some other covert 
fashion. Ultimately, this covert activity also disappears, and 
he may report that he prints or "copies" his characters auto- 
matically, without any thought of what he is doing. 

A tentative analysis of such changes would run as follows. 
In the first stage of code-receiving, the signal occasions vari- 
ous responses, sometimes overt, which serve in turn as stim- 
uli for the copying response that ends the sequence. Later, 
these intervening responses become covert, although still 
present as members of the chain. Finally, they are eliminated 
entirely, and the observed decrease in latency is thereby made 
possible. The latency decrease may probably be explained in 
terms of two principal factors: the failure of long-latency 
responses to receive reinforcement, and the accumulation of 
negatively reinforcing effects produced by the continued repe- 
tition of (unnecessary) intervening activity. 

Covert Response and Reaction Time 

In our treatment of reaction times in Chapter 5, we have 
noted the classic distinction, by Bonders, of the A-, B-, and 
C-reactions. We might also have considered another type, 
called by Wundt the D-reaction, which was said to be dis- 
tinguished by the fact that it involved a recognition or identi- 
fication of the stimulus prior to the overt response. What was 
essentially this D-reaction came later to be called the sen- 
sorial reaction and was contrasted with the muscular reaction 
(the successor of Bonder's A-reaction). In the sensorial reac- 
tion procedure, a stimulus, say a sound, is presented and the 
subject responds only after it has been apprehended; whereas, 
a muscular reaction requires the subject to pay attention to 
the response only. The sensorial reaction time was found by 
Lange (1888) to be about one hundred milliseconds longer 
than the muscular reaction time. 

If you have followed our discussion and are ready to con- 
sider the recognition, identification, or apprehension of a 
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stimulus as basically the same sort of phenomenon as think- 
ing about a stimulus before responding to it overtly, you can 
see why we deferred the treatment of the D-reaction until 
this time. The reason for the lengthened latency of the sen- 
sorial reaction becomes the same as that which holds back the 
progress of the beginner in copying Morse Code. An inter- 
vening covert responsean extra link in the chain is involved 
in both cases. 

An additional factor adds weight to this analysis. In the 
history of reaction-time study, the distinction between sen- 
sorial and muscular reaction time proved very difficult to 
maintain under experimental conditions. When subjects 
were repeatedly asked to make the sensorial reaction, their 
times tended to approach closely the values obtained for the 
muscular reaction. Practice seemed to decrease the difference 
between the two. Just as the code student conies to respond 
automatically to the presented signal with a minimal latency, 
so does the laboratory subject find it all too easy to eliminate 
the cognitive activity which characterizes the sensorial re- 
action. Our explanation of the two changes would be much 
the same. It is unrewarding work to maintain the covert 
member or members of the chain. 

The Context Theory of Meaning 

Our mention, in the preceding section, of such matters as 
cognition and apprehension, leads quite naturally into the 
discussion of another problem, that of meaning. Common 
-sense tells us that objects, ideas, and actions ordinarily mean 
something; and the psychologist is not infrequently asked to 
give his explanation, to say why. 

One answer to this question is that offered by Titchener 
(1915), in the early years of the present century, and it may 
be of more than passing interest to consider his formulation 
and the bearing of our own position upon it. Titchener was 
an introspectionist in psychology, who believed that a man's 
conscious processes were basically analyzable into elements, 
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such as sensations and images; that these elements were usu- 
ally compounded within perceptions and ideas; and that an 
obvious characteristic of the latter was the meaning they pos- 
sessed. (It was, of course, assumed that sensations, images, per- 
ceptions, and ideas were subject to the introspector's personal 
observation.) 

"Meaning," says Titchener, ". . . is always context, one mental process 
is the meaning of another mental process if it is that other's context/' 
Context itself is nothing more than "the fringe of related processes 
that gathers about the central group (or "core") of sensations or images/' 
In perception and in idea there is core plus context, and the latter 
"carries" the meaning of the former. 

Titchener offers a number of illustrations to show the wisdom of 
this core-context distinction. Context may, in some cases, be stripped 
from the core as when we repeat aloud some word until the context 
disappears and the word becomes meaningless; context may be added 
to coreas when we learn the meaning of some strange design or 
foreign word; context and core may be disjoined in time as when we 
know what we want to say but need time to find expressive words, or 
when the point of a joke is delayed in appearance; the same core may 
have several contexts as implied in our worry about the true meaning 
of a chance remark . . .; and so on. (Keller, 1937) 

Titchener had still more to say about the matter. Meaning > 
as he saw it, is originally derived from an organism's move- 
ment: context is initially kinesthetic or muscular sensation, 
such as that aroused by our bodily orientation toward a stim- 
ulus. Later, the context may be visual, auditory, and so forth. 
Ultimately, however, the meaning becomes almost entirely 
verbal when the context is added by something we say (to 
ourselves) about a stimulus. Or it may even reduce to a 
"brain habit/' without any conscious context; we respond 
automatically and appropriately, but without any fringe of 
conscious process which supplements the core. 

However strange this theory may sound to you at first, a 
little thought will make it less so. What Titchener calls con- 
text is no more than a chain of covert responses which inter- 
venes between the initial, observable stimulus and the final, 
overt reaction. Recall, for a moment, our code-learning ex- 
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ample. A code signal has hardly any meaning for the student 
at the start; it evokes no more than a kind of "what is it?" 
response. Later, its meaning grows as the intervening re- 
sponses (the "di-dahs," the visualizing, etc.) come in. Finally, 
as these responses drop out, the 'meaning' of the code de- 
creases. Clearly, intervening covert response is the counter- 
part of Titchener's "conscious context." 

You can go on with the translation, point by point it is not 
difficult if you have understood the main theme. The trans- 
lation will not provide you with a complete theory of mean- 
ing but it is encouraging to find that an objective science of 
behavior is able so easily to make use of the findings of pre- 
behavioral introspective psychology. 

The Association Experiment 

Related to the principle of chaining, as well as to the re- 
action-time study, is the well-known association experiment 
a technique which has been used extensively in psychologi- 
cal laboratories and clinics since 1879, when Sir Francis Gal- 
ton introduced the basic procedure. Galton presented himself 
visually with 75 stimulus words, one at a time, from a pre- 
viously prepared list, and measured with a chronograph the 
time elapsing between his first view of each word and the 
arousal of the first two ideas which were suggested by it. After 
each stimulus-response sequence of this sort, he undertook to 
examine the nature of the ideas (whether they were visuali- 
zations, verbalizations, or some other acts) and their origin 
in his own life history (whether from boyhood and youth, 
from subsequent manhood, or from more recent experiences). 
By repeating his list four times, at one-month intervals, he 
was led to the following conclusions among others: (i) 45 
per cent of his responses were purely verbal, 32.5 per cent 
were in "sense imagery" (e.g., visualizations), and 22.5 per 
cent were "histrionic" (postural); (2) the most frequently re- 
curring ideas in the four tests dated from his boyhood; (3) 
the average time estimated for the recall of a single idea was 
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1.3 seconds; and (4) some of the ideas recalled were unfit for 
publication, since they laid bare his "mental anatomy"! 

In the modern form of the association experiment, words 
are still employed as stimuli, but they are always presented by 
another person, and the time measured is usually that which 
elapses between the stimulus word and the subject's overt 
response with the "first word that comes to mind." As a rule, 
the stimuli are drawn from standard lists and presented to 
individual members of various groups of subjects previously 
selected on the basis of their age, sex, occupation, educational 
level, and so forth. In some cases, the experimenter's prin- 
cipal interest lies in the nature of the response words emitted; 
in other cases, the reaction times also receive attention. 

One of the better-known studies emphasizing the kind, 
rather than the speed, of response is that made by Kent and 
Rosanoff (1910), in which one hundred familiar nouns and 
adjectives were presented vocally to 1,000 subjects, mostly 
adults, who were selected, more or less at random, from the 
general population. When all the response words were tabu- 
lated for these subjects, a frequency table was constructed to 
show, for each stimulus word, the number of subjects who 
responded in identical fashion. Thus, light was given in re- 
sponse to dark by 427 of the 1,000 subjects, night was given 
by 221, black by 76, and so on. As this example indicates, 
considerable agreement in responding was shown. The in- 
dividual stimulus words, however, varied greatly in the num- 
ber of different responses they occasioned. Anger, at one 
extreme, led to 276 kinds of verbal response, whereas needle 
at the opposite extreme, led to only 72. Anger, we might 
say, has many more meanings than needle. Moreover, individ- 
ual subjects differed markedly in the frequency with which 
they tended to make unique responses. Some tended to "fol- 
low the crowd" while others leaned in the direction of the 
bizarre and eccentric, indicating very different histories of 
reinforcement. 

Many studies of associative reactions have been made with 
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the Kent-Rosanoff and other word lists, using a great variety 
of subjects, a number of different experimental situations, 
and several systems of classifying the responses. These studies 
cannot be reviewed here, but a few of the findings may be 
mentioned as worthy of interest. For example, children, in 
contrast with adults, respond with (i) more completions of, 
or enlargements upon the stimulus words (dark night, red 
wagon, soft pillow); (2) more defining responses, often 
involving several words (table made of wood); (3) fewer op- 
posites (dark light, soft hard); and (4) more sentence re- 
sponses (soft snow is soft). Very young children may often 
show a tendency, not entirely absent in adults, to repeat the 
stimulus word (lamp lamp). 

A thoroughgoing analysis of the association experiment is 
yet to be made. The basic principle involved is quite clear, 
however, and has often been recognized, openly or by impli- 
cation. It may be stated simply. In the history of an organism, 
one verbal response, through chaining, becomes the S D for 
another. The stimulus word is part of a chain which has the 
response word as another member. A single stimulus word 
may, however, belong within more than one chain, and the 
likelihood of a given response to such a word will then de- 
pend upon a number of factors, operating individually or 
together. These include: (i) the amount of reinforcement 
which a specific chain has been accorded; (2) the amount of 
collateral strengthening due to the presence of certain S D 's 
in the experimental situation; (3) the presence of strong mo- 
tives or emotions; (4) the recency with which the response has 
been reinforced; and (5) the amount of energy expenditure 
involved. 

An example may illustrate the possible operation of these 
factors. The stimulus word is dark, (i) Adults have long used 
this word in juxtaposition with light; children have more 
frequently followed it with night; photographers have often 
combined it with room; and so on. Different responses have 
different reinforcement histories. (2) Any one of the above 
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responses may be given additional strength by some aspect of 
the immediate stimulus situation. Light may be helped by 
some conspicuous lighting effect in the experimental cham- 
ber; night may be partially initiated by the gathering dark- 
ness observed outside the laboratory window; and room may 
be aided by the dimness of the chamber (This room is dark) 
or even by some confining aspect of the situation (Not much 
room here). (3) For a hungry subject, growing eager for his 
meal, dark may occasion meat; and for a subject in whom 
dark has become a conditioned stimulus for a violent fear 
reaction, all responses may momentarily be inhibited. (4) For 
a subject who came to the experimental session directly from 
a political discussion or talk of a coming election, dark may 
well evoke horse. (5) Finally, of several possibilities, some re- 
sponses may be less effortful than others. Consider the fol- 
lowing hypothetical responses to dark: 

(i) the forest primeval (2) complexion (3) light (4) bark 

Classically these responses might be said to fall within a scale 
of 'meaningfulness-superficiality,' with the most meaningful 
at the top of the list and the most superficial at the bottom. 
If 'superficial' is taken as the equivalent of 'economical' (as it 
sometimes is), and if we see the reactions as running from 
least to most energy-conserving, they also fit well into an 
effortful-effortless scale. It is then quite understandable that 
increased superficiality of response has been claimed by re- 
searchers to accompany states of fatigue and repetition of 
stimulus lists. 

Our list of determining factors may not be exhaustive, but 
it should suggest to you that the emission of a verbal re- 
sponse, like any other, may depend upon a variety of circum- 
stances. More often than not our responses depend upon the 
joint operation of several factors, rather than the isolated 
action of one. Clever experimental design and close weighing 
of probabilities may be required to illuminate the influence 
of a single variable upon the behavior of an organism with 
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uich a complicated past as that of the average adult human 
being. 

When we turn to the matter of reaction times in the as- 
sociation experiment, we are immediately confronted with a 
large body of factual information most of which adds little to 
the present discussion. Perhaps the two outstanding facts, 
noted by many investigators, are (i) that the reaction-time 
values in this type of experiment are usually much greater 
:han in the case of word-reactions to non-word stimuli; and 
[2) that other-word reactions are considerably slower than 
tame-word (repeating) reactions. Such findings are in accord 
with the fact that covert response words are commonly re- 
Dorted to intervene between the verbal stimulus and the 
[overt) verbal response. Moreover, since any stimulus word 
nay have been linked with several words in previously es- 
.ablished chains, we would expect occasional interference of 
responses an expectation which is often supported by sub- 
ects who claim that response words "get in each other's way." 
There is also the fact that some stimuli, by virtue of earlier, 
-espondent conditioning, may serve to depress for a short 
period all operant behavior, including the verbal; and this 
vould obviously lengthen the reaction times of some subjects 
.o some of the words of a stimulus list. 

'Complexes' and 'Guilt' 

It is partially due to the last-mentioned fact that the associ- 
ition experiment has been used as a complex-indicator and 
is a guilt-detector. Carl Jung (1919) , the famous Swiss psy- 
:hotherapist, was the first to see that 'complexes' of repressed 
Behavior might be brought into the open by probing with 
timulus-words; and he made an exhaustive analysis of the 
issociative reactions which pointed to the existence of these 
:omplexes. His work amounted to a recognition of the fact 
hat certain, ordinarily neutral, words might have become 
emotionally 'charged* for a patient by virtue of their con- 
lection with some earlier experience, generally involving 
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negative reinforcement, and that the responses normally 
found linked with such words were thereby less likely to oc- 
cur. When presented with the stimulus words of a 'free-as- 
sociation* test, the patient might respond to some of them (i) 
with very unusual, far-fetched, or highly personal words; (2) 
by repeating the stimulus word before making an other-word 
response; (3) by 'misunderstanding* the stimulus word; (4) 
by giving exceptionally long reaction times or no word-re- 
sponse at all; and (5) by obvious signs of emotional upset, 
such as blushing, stammering, whispering the response word, 
and so on. Jung found that testing and re-testing with care- 
fully chosen stimulus words often aided him greatly in get- 
ting at the roots of his patients' troubles, thus paving the road 
to therapy; and his lead has been followed by many clinicians 
since. 

The clinical use of the association experiment is like its 
use as a 'guilt-detector/ but with a difference. "The detec- 
tive knows the crime, but not the culprit; the psychotherapist 
knows the culprit but not the crime." (Woodworth, 1938). 
We may add that the 'crime' in question is likely to be 
crime by courtesy only, since it is more often an affair of the 
classroom than of the courts. In the simplest demonstration 
of guilt detection, two students may be drawn from a class 
and given secret instructions as to their conduct during, say, 
a ten-minute period outside the room. One student (the 'cul- 
prit') is sent into a course of action which is likely to produce 
embarrassment. For example, he may be asked to enter a 
professor's office, during his absence, and engage in some triv- 
ial activity such as arranging papers on a desk. In the course 
of this activity a 'mishap* of some sort is arranged to occur 
and the 'unsympathetic' professor unexpectedly returns. The 
other student is subjected to no such experience, but, at the 
end of the ten-minute period, both men are brought back to 
the classroom and given the free-association test individually. 
The class is acquainted with the nature of the 'crime,' but 
does not know the culprit; and the latter has been instructed 
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to hide his guilt if possible. The same set of ordinarily neu- 
tral stimulus words is presented to each student, but several 
of these words have been chosen because of their possible 
significance to the 'criminal' through his recent experience. 
Response words are recorded and reaction times are taken 
with a stop-watch. In this 'experiment* the class, as a rule, has 
no great difficulty in detecting the culprit, either by his un- 
usual responses to such common words as ink, desk, office, 
Professor, and the like, or by unusually long reaction times. 
Unfortunately, the validity of this type of detection is not 
great enough to justify its exclusive use in the detection of 
real criminals. When carefully administered and accompa- 
nied by other data (e.g., changes in blood pressure or the gal- 
vanic skin response) it has sometimes been successful in 
evoking confessions, but it has serious limitations, since it is 
not sure-fire and may even point the finger of guilt at per- 
sons who know about the crime but did not commit it. 

Mazes and Nonsense Syllables 

A discussion of chaining leads us naturally into two other 
well-known spheres of psychological research. One of these 
has been chiefly concerned with the ability of animals (white 
rats in particular) to run mazes; the other has dealt with the 
rote memorizing of verbal materials (for example, nonsense 
syllables) by human beings. Both have often been cited as 
cases of "serial learning," which gives you a clue as to the 
reason for their inclusion in the present chapter. 

Hundreds of maze-learning experiments have been carried 
out since the turn of the century, when the technique was 
first introduced, with results that often impress observers as 
small, considering the amount of labor expended. What was 
considered, at the outset, to be a very simple form of learn- 
ing, and the possible prototype of habit-formation in human 
beings, is in reality very complex. Moreover, we now see the 
problem merely as one calling for the application of explana- 
tory principles. 
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We are now able (as we were not in Chapter 3) to enu- 
merate some of the more important factors involved in the 
mastery of any maze habit. We can say, with assurance, that 
the solution involves the establishment of a chain of reflexes, 
initiated when an animal is placed in the maze and termi- 
nated when he arrives at his reward. From our analysis of 
bar-pressing behavior we can assert that successive discrim- 
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FIG. 51. Curve relating the length of nonsense-syllable lists to the 
time taken to memorize the lists. The subject was given one trial per 
day until a given list was mastered. The values on the abscissa and 
ordinate are the actual lengths-of-list and corresponding memorizing- 
times. Note the positive acceleration of the curve arising from the ever 
increasing generalization as the lists (chains) grow longer. (After 
Lyon, 1917.) 

{nations and differentiations are involved at successive points 
in the maze pathway; and we can see, as basic to these, the 
principles of operant conditioning and extinction. Also, and 
of extreme importance, we can assume the operation of sec- 
ondary reinforcement, a factor with which we shall deal in 
the coming chapter. 

With respect to rote-learning studies, the situation is much 
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the same. Since 1885, when Ebbinghaus reported his famous 
experiments, thousands of students have memorized thou- 
sands of lists of words or nonsense syllables under various con- 
ditions and to various degrees of mastery. Rate of learning, 
as measured by number of list repetitions or time required to 
reach a given standard of performance, has been related to 
length of list, meaningfulness of list material, spacing of 
practice periods, and many other variables. Similarly, the re- 
tention of such memorized lists has been studied as a function 
of time-since-mastery (and the nature of the activity that 
filled this time), rate of initial learning, amount of over-prac- 
tice, and so on. 

The problem of rote memorization is similar to that of 
maze learning. It has practical significance because it is like 
some of the tasks that face us in everyday affairs. Like maze 
learning, it presents complex matters for analysis (Hull, et al., 
1940) and involves the operation of more fundamental mecha- 
nisms, but we are not yet at a stage of development where we 
can do more than suggest the manner of their interaction. 
The over-all picture is one of verbal chaining, in which the 
emission of each response comes to depend upon the S D func- 
tion of the response, or responses, preceding it in the series. 

NOTES 

Some exploratory observations by Charles Ferster, in the Columbia 
laboratory, bear upon the problem of 'homogeneous' chaining and the 
formation of new response units. Ferster was able to train several rats, 
with some success, to make paired responses for food reinforcement. 
He observed, among other things, that, when pairing was well estab- 
lished, the two members of the pair were seldom if ever of the same 
form. The first response might be with one forepaw and the second 
with the other; the first might be a fairly protracted push of the bar 
and the second might be a sharp slap; and so on. Heterogeneity of the 
two links, rather than homogeneity, was the rule. This gives support 
to our statements in this chapter concerning the improbability of purely 
homogeneous chaining. 

More than thirty years ago, Swindle (1917, 1919) published several 
reports on the behavior of animals (e.g., the pecking behavior of cocka- 
toos) in the Berlin zoo. Such responses, in spite of a high degree of 
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stereotypy, showed some variation in force, even when emitted at a 
very constant rate. Swindle also pointed out that what we call a 
reaction time is dependent upon the number of responses that intervene 
between the stimulus and the response that we have selected for ob- 
servation. We are reminded not only of the associative reaction, but 
also of such 'latencies' as are measured in the run-way and the bar- 
pressing experiments. 

An excellent account of the association experiment is given by Wood- 
worth in his Experimental psychology (1938). You should now be able 
to appreciate some of the factors involved in this kind of experimenta- 
tion and find, in Woodworth's examples, plenty of food for thought 
and some suggestions for future research. 

The "dream reactions" reported by Max (page 214) to result from 
mild stimulation of his sleeping subjects, makes more credible many 
non-experimental reports of induced dreaming. Thus, it has been said 
that exposure of a sleeper's feet to cold may evoke dreams of, say, 
Arctic exploration; that the repeated tipping of a sleeper's bed may 
lead to dreams of ocean voyages; that the entanglement of legs or arms 
in bed-clothes arouses dreams of frustrated action; and so on. 

Early investigations (e.g., Thorsen, 1925) of the relation between 
covert response and 'inner speech* or 'thinking' apparently suffered 
from inadequate experimental technique. Modern methods of record- 
ing the electrical changes that arise from covert muscular movement 
(as in the Jacobson and Max studies) open up a large field of research. 
Studies by Davis (1948) and others suggest that a frontal attack is 
possible upon such 'intervening' responses as those involved in asso- 
ciative reactions, meaning, and the like. 

One of the first and most elaborate treatments of covert response 
and its psychological significance is that of Kantor (1924) who found 
in such response the basis of many allegedly 'mental' activities perceiv- 
ing, imagining, thinking, feeling, remembering, and so forth. Kantor's 
account, although not obviously guided by experimental findings, has 
many points of contact with the one of this chapter. The advanced 
student will not let the difficulty of this book deter him from making 
the effort required to appreciate its contents. 
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THE CAWING of a rook ... in itself, is certainly not agreeable. This 
sound, in the case of those who have lived in the country in early life, 
and enjoyed its scenes and its adventures, is well known to become a 

particularly agreeable one The explanation is that this particular 

sound, having been heard again and again among surroundings . . . 
which have a marked accompaniment of pleasure . . . produces a faint 
re-excitation of the many currents of enjoyment which accompanied 
these. 

James Sully, The Human Mind, 1892 

A Pause for Review 

The last chapter's discussion of response chaining has no 
doubt sharpened in your mind a question that was forming 
earlier. In a chain, each response produces the stimulus for 
the next response, but why should each producing response 
be made what keeps the chain together? The roots of this 
question go back to our first descriptions of operant reactions, 
where for the sake of simplicity the question was laid aside so 
that our analysis of behavior could get under way. We have 
now come to the point where the problem must be aired and 
dealt with. In so doing, we encounter a new principle which 
fills a key place in the jig-saw problem of behavior that we 
have been carefully assembling in the preceding pages. This 
principle, if we may name it shortly before explaining it, is 
that of secondary reinforcement. With its aid, we can make 
new and material progress in rounding out our picture of 
that extraordinary object of study, the behaving organism. 

Perhaps the best way to come at the principle of secondary 
reinforcement is by a review of the ways in which a stimulus 
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can affect behavior. If we look back over what has been said 
so far, we can summarize the functions of stimuli quite 
briefly. Stimuli may be (i) eliciting, as in respondent reac- 
tions; (2) discriminative, in that they "set the occasion" for 
the emission of an operant response; (3) reinforcing, as when 
a new Type S reflex is brought into being by the uncondi- 
tioned stimulus, or an emitted operant is strengthened by the 
presentation of a food pellet. Reinforcing stimuli in Type S 
reactions are correlated with others which then become ca- 
pable of eliciting the response. In Type R, the correlation is 
with a response, the contingency between response and re- 
inforcing stimulus serving to increase the emission frequency 
of that response. We have not spoken as yet of two other func- 
tions of stimuli, the drive-arousing and emotionalizing func- 
tions, but these will be considered in the next two chapters. 
To the three we have enumerated, however, we wish to add 
as a fourth that of secondary reinforcement, at times called 
conditioned reinforcement. 

A New Function of the Stimulus 

We are led to the realization of a new kind of stimulus 
control over behavior, by inquiring what the effect upon a 
non-reinforcing stimulus may be when it accompanies a re- 
inforcing one. When, after many such associations, we apply 
the erstwhile non-reinforcing stimulus to the organism, we 
look to see whether its action has undergone a change. The 
appropriate experiment can be easily arranged. But before 
any laboratory trial is made at all, a reconsideration of al- 
ready well-known experiments and observations leads us to 
the hunch that a stimulus which is not originally a reinforc- 
ing one (or which is not f as we often say, a "primary re- 
inforcement") can become reinforcing through repeated 
association with one that is. That is, reinforcing power may 
be acquired by a stimulus through being present when an 
original reinforcement is given. 

Working from this same hunch, scientists have now estab- 
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lished its truth. In the discussion of the principle that fol- 
lows, we shall report some of the evidence they marshalled for 
it; we shall consider some of its recent experimental exten- 
sions; and we shall indicate, at the same time, its multifold 
implications for human and animal behavior. 

Secondary Reinforcement of Type S Reactions 
You may recall that in Chapter 2 we mentioned reports, 
from Pavlov's laboratory, of higher-order conditioning. In 
those experiments, conditioned stimuli were used as reinforce- 
ments for others in the absence of the original unconditioned 
stimulus. For example, a buzzer conditioned to elicit saliva- 
tion was then paired with a visual stimulus like a black 
square, and presumably sufficed to establish the square as a 
conditioned eliciting stimulus of the second order although 
the latter was never once reinforced by pairing with the food. 
The crucial point about these experiments was that a stimu- 
lus seemed able to acquire reinforcing power through con- 
ditioning, and thereafter to act independently to condition 
another reflex. Despite the admitted weakness and instability 
of this higher-order conditioning, the phenomenon aroused 
much discussion among psychologists. Behavior theorists ap- 
pealed to the principle in explaining those complex acts of 
everyday life which are neither initiated nor sustained by 
unconditioned stimuli like food, water, and sex. Little more 
was done experimentally with Type S conditioned reinforce- 
ment after Pavlov, but it is well that the principle was not 
lost sight of, because a parallel for it was found in operant 
behavior one that assumed even greater importance. 

Secondary Reinforcement of Type R Reactions 

A stimulus is said to be reinforcing if it possesses the ca- 
pacity to increase or maintain reflex strength above operant 
level. In operant conditioning, it must be able to raise the 
strength of the response producing it, or be able to sustain 
the strength to some degree in the face of extinction. (In re- 
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spondent conditioning, it must increase the eliciting strength 
of a stimulus paired with it.) Here, then, is the criterion to 
be met by a secondary reinforcer: it must, through condition- 
ing, have acquired the power to condition. Moreover, we may 

recognize at once that, as 
with any conditioned stim- 
ulus, a secondary reinforcer 
used repeatedly without 
further association with 
primary reinforcement will 
extinguish, that is, lose its 
power to reinforce. 

We may summon a few 
introductory examples to 
demonstrate the operation 
of secondary reinforce- 
ment. Remembering that 
the signs of an effective 
operant reinforcer are an 
increase in emission rate, 
or the retardation of ex- 
tinction, the following ex- 
periments illustrate botli 
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FIG. 52. Conditioning of the bar- 
pressing response by rats with sec- 
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they were blocked off by a sheet of glass. These chicks were 
accustomed to rice as food, and in the experiment the 
visual stimulus alone acted as a secondary reinforcement for 
strengthening the running response. Williams (1929) found 
that rats, daily trained in using the white compartment of a 
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discrimination box as a route to food, were able to master a 
fourteen-unit T-maze with no other terminal reinforcement 
than the white compartment itself. A similar result has more 
recently been reported by Saltzman (1949); and other investi- 
gators have shown that animals will learn mazes or other re- 
sponses if the reinforcement provided is simply a return to 
their living-cage homes. Bruce (1932) showed that the extinc- 
tion of a simple maze performance was slower when food was 
present though inaccessible (behind wire mesh) at the end of 
the run than when no food at all was present. Bugelski (1938) 
got slower extinction of bar-pressing in rats when each re- 
sponse produced an auditory click which had regularly ac- 
companied food-reinforced bar-pressing during conditioning, 
than when the extinction responses produced neither click 
nor food. 

Still other examples of secondary reinforcement come to 
light if we suddenly bring to bear upon an extinguishing re- 
sponse a stimulus which had previously been correlated with 
reinforcement. Thus, if bar-pressing is being extinguished in 
a rat, and when the response rate has fallen quite low we re- 
connect the empty magazine so that a response now produces 
the accustomed clicks (but still no food), the extinction curve 
shows a burst of responses. This effect was mentioned in the 
discussion of chaining in the last chapter (page 202), and it 
is a neat case of secondary reinforcement. Notterman (1950), 
using rats as subjects in a runway experiment, during the 
training correlated a light with food as the terminal rein- 
forcement for runs. Later, during extinction of the running 
response, he re-introduced the light at the end of runs, and 
found that it acted as a secondary reinforcer since the run- 
ning speed quickened significantly and durably. Notterman 
ran control animals to prove that it was not just any stimu- 
lus change at the end of a run which could be expected to 
produce this effect, but that the stimulus used as a secondary 
reinforcer needed to have a history of association with primary 
reinforcement. 
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S^ and S r 

The reader who has trained himself by this time to raise 
his own questions about details of behavior, will not have 
failed to ask what we mean by "a stimulus correlated with 
reinforcement" in describing an S r . (We shall from now on 
use S r to denote a secondary or conditioned reinforcement; 
the symbol S D , for a discriminative stimulus, is already fa- 
miliar.) What kind of a "correlation" must it be: what tempo- 
ral relation to the reinforcement, and what sort of association 
with it? That mere contiguity with reinforcement is not suf- 
ficient to make an S r out of a neutral stimulus is shown by 
an experiment of Schoenfeld, Antonitis, and Bersh (19503). 
These men trained animals to come to the cage's food tray 
at the sound of a pellet falling into the tray. After an animal 
had seized his pellet, and while he was eating it ( which took 
about ten seconds on the average), a light was turned on for 
one second. One hundred such light-pellet associations were 
given each animal, and then the possible S r value of the light 
was tested by seeing whether bar-pressing could be condi- 
tioned by giving only the light as reinforcement. The net 
result of these associations was zero, and since this type of 
correlation did not work, it became clear that one had to dis- 
cover just what kinds of association could give rise to a sec- 
ondary reinforcer. Though not so plentiful as we would wish, 
all the evidence we have so far points in the same direction. 
In order to act as an S r for any response, a stimulus must have 
status as an S D for some response. 

The foregoing sentence, standing alone, is not easy to grasp 
right off, and three points of explanation probably ought to 
be made. 

1. From the viewpoint of procedure, the training condi- 
tions for obtaining an S r are the same as those for obtaining 
an S D . We must first make an S D of the stimulus we wish to 
use later as an S r . 

2. The response for which the stimulus is specifically used 



SECONDARY REINFORCEMENT 237 

as S D is not the only one for which it can be used as an S r . For 
example, in a chain of heterogeneous reflexes, a response 
which produces the S D for the next response is thereby 
strengthened, that is, the S D for the next response in the chain 
acts as an S r on the response producing it even though that 
response is different from the next. It is worth recalling here 
that in Skinner's original conditioning experiments the same 
feature appeared. His rats were first trained to come to a tray 
for a pellet at the sound (S D ) of the discharging magazine. 
After being well accustomed to this reaction, the bar was 
introduced, and pressing was conditioned by the magazine 
sound which the rat himself produced by his pressing. In 
these instances, we see the stimulus acting as a reinforcing 
and a discriminative one at the same time. We shall shortly 
return to this matter in discussing how chains are cemented 
together and the "generality of S r ." 

3. The equality and interchangeability of the S D and S r 
values of a stimulus are suggested by an experiment of Dins- 
moor (1950). This worker gave several groups of rats the same 
amount of light-dark discrimination training, using the bar- 
pressing response and food reinforcement. A series of extinc- 
tion sessions followed in which the S D established during 
training was used discriminatively (i.e., preceded the re- 
sponge, and "set the occasion" for it) with one group of sub- 
jects, and reinforcingly (i.e., was produced by the bar-pressing 
responses) with another group. No difference in extinction 
responding occurred between these two groups. Moreover, 
in continued extinction sessions, the procedures were inter- 
changed, so that for one group the stimulus hitherto used 
discriminatively was now used reinforcingly, and vice versa 
for the other group. This interchange of stimulus functions 
did not produce any differences in the performances of the 
groups. Apparently, establishing a stimulus as an S D estab- 
lishes it equally as an S r , and extinguishing its power to func- 
tion in one way also subtracts, in like degree, from its power 
to function the other way. This conclusion, tentatively drawn 
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by the investigator, leads us to wonder how far the exact 
equality between the S D and S r functions of a stimulus may 
extend. Dinsmoor's study points up the question and pro- 
vides a basis for further work. 

Not only are the S D and S r values of a stimulus closely in- 
terrelated, but the same stimulus can, as in chaining, exercise 
both functions at the same time. What, then, is the reason for 
continuing the distinction between S D and S r as "two func- 
tions'? The reason lies, of course, in the way the experi- 
menter is using the stimulus, that is, whether he wishes it, 
in relation to some selected response, to act discriminatively 
or reinforcingly. Once a stimulus has been given a measure of 
control over behavior through correlation with reinforce- 
ment, that control may be used in either S D or S r fashion. 

That the distinction between stimulus functions on the 
basis of use is important to retain, can be gathered from the 
following example of how the two uses may lead to different 
results. Suppose we have established a stimulus as an S D for 
approaching and nosing the food tray in the cage. After- 
wards, desiring to test its controlling power, we introduce the 
bar into the cage for the first time in the animal's experience. 
If we decide to test the stimulus as S r , we present it after each 
pressing and observe that the response rate goes up, subse- 
quently giving an extinction curve (since primary reinforce- 
ment is not used to bolster the S r 's power). But, if we decide 
to use the stimulus as an S D for bar-pressing, and somehow 
manage to have it come on before operant-level pressings, we 
find no increase in rate of pressing, but rather an increase in 
the rate of the response which the stimulus followed. In short, 
where we deal with a new response having no previous his- 
tory of reinforcement, the S r function can have a specific ef- 
fect upon the response, whereas the S D function cannot. This 
difference in the outcome of the two uses shows that we are 
still justified in separately discussing the two stimulus func- 
tions. After conditioning, a stimulus may perhaps never be 
divested of its dual endowment, but it is necessary in analysis 
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to isolate the way it is acting in a given situation just as a 
material body may possess both mass and extent at the same 
time, but we do not hesitate to separate the two aspects for 
study and use. 

More about Chaining 

We are now in a position to say a bit more about the topic 
of the preceding chapter. A chain of reflexes is built up by 
having each response produce the S D for the next response; 
it is held together by the fact that each S D acts as an S r for the 
response producing it. To indicate the double action of each 
stimulus, we can give it a double superscript, as S r D , placing 
the r before the D because its reference is to the prior re- 
sponse, while the D has reference to the following response. 
The general notation for a segment of a chain would then 
be like this: 

.... -^S T ' R *-s r ' R -*- 

If we number the terms in the formula to correspond to the 
sequence in which they are built into the chain, we get: 

in which S r , R , and S Do are, respectively, the final reinforce- 
ment, final response, and final discriminative stimulus in the 
chain; in which S r * is the first secondary reinforcement to be 
"hooked in"; and in which (s) is the operant-level source 
of the remotest antecedent response in the chain, R n , to the 
remotest discriminative stimulus S Dn . 

The preceding chapter also emphasized that the S r D terms 
in a chain may be either exteroceptive or proprioceptive 
stimuli. Psychologists have for a long time been interested 
in the smooth flow and continuity of reflex chains, and 
have been especially impressed by chains based on pro- 
prioceptive cues which are hidden from the experimenter's 
view. In describing how chains are maintained and carried 
through, many writers have used such a term as set. Organisms 
were thought of as becoming "set for" a task by way of a 
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"preparatory set/' and as being guided or directed in execut- 
ing a task by a "continuing set." The term itself was never 
well defined but the observations upon which it was based 
are to be analyzed from the standpoint of chaining. The in- 
itiation of chains by discriminative stimuli, and the sustain- 
ing of a sequence by response-produced reinforcing stimuli, 
are at bottom the real meaning to be attached to "sets," both 
preparatory and continuing. 

In reflecting on the ways in which response chains may be 
disturbed, we note that there are two. One is that of cutting 
or breaking the chain at some point by withholding the next 
S rD in short, by extinction of all segments up to that point. 
A second is that of a response producing the "wrong" S D , that 
is, not the one necessary for going on with the rest of the 
chain. This is an occasion for "confusion" or "interruption 
of the set," so that the completion of the chain is diverted or 
slowed down, or is temporarily halted while the organism 
"readjusts," "takes stock of the situation," or "finds another 
way" to resume the interrupted activity. A hound intently 
following his quarry through the brush will take some time 
to regain his composure and return to the trail if he unex- 
pectedly pokes his nose toward a porcupine or a snake. A 
man's sleepy-eyed morning routine of washing is quickly 
alerted when, after drowsily squeezing a tube, he tastes shav- 
ing cream on his toothbrush; and parents are familiar with 
the outrage of the child who detects a mistake in an oft-re- 
peated songas when the weary parent errs: "Simple Simon 
met a pie-man going home to bed. . . ." In these, as in many 
other cases of interrupted chains, the behavior does not stop 
completely, but is apt to take, at least for a while, another 
direction. 

The Generality of S r 

In our treatment of secondary reinforcement so far, we 
have made use of a fact which may now be given the explicit 
emphasis it deserves. It is this fact that makes the principle of 
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secondary reinforcement so significant for complex activities 
like the social behavior of human beings. Here it is: once 
established, a secondary reinforcement can strengthen other 
responses than that used during its original establishment, 
and can do so under other motives than that prevailing dur- 
ing the original training. This means that an S r is not limited 
in its range of applicability, and that it provides us with a 
"purely general" reinforcement for use under any conditions 
we desire. We have already seen that a new response can be 
conditioned by an S r , and we are now adding the important 
observation that this may be done even if the motive is 
shifted (e.g., Estes, 19493, ig4gb). On the animal level, for 
example, we can correlate a stimulus with water-reinforce- 
ment of bar-pressing by thirsty and not-hungry rats, and then 
use the stimulus as an S r to condition chain-pulling when the 
animals are hungry but not thirsty. On the human level, the 
words "Well done!", or the approval of other people, or 
money, may act as secondary reinforcers for a variety of be- 
haviors under any prevailing motive. 

The fact of S r independence should not make us forget, 
however, that (a) an S r extinguishes or loses its power with 
repeated application if separated from primary reinforce- 
ment and (b) as with any reinforcement, some drive must be 
present for the S r to be effective. We need not always pause 
to identify either the primary reinforcements that continue 
to support the power of S r 's, or the prevailing drive under 
which the S r is acting. Always, however, there is implied the 
assumption that both are present. In a broad way, you may 
think of the supporting primary reinforcements as being such 
things as our daily food and drink, or relief of pain or anxi- 
ety; and the operative drives as occasional hunger and thirst, 
sex drive, and aversive tensions or fears. In a hypothetical 
environment shorn of primary reinforcements, or in which 
organisms are devoid of motives, S r 's would either extinguish 
completely or never act. 

Students sometimes offer examples of a social stimulus ap- 
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parently separated from its primary reinforcement which 
nevertheless indefinitely retains its control over behavior. 
Mother is no longer, seemingly, related to food, comfort, and 
so on, yet she remains a strong controlling stimulus. But we 
may note that if mother is no longer present, she cannot 
extinguish as an S r ; if she is present, she often remains as- 
sociated with primary reinforcement (like "Thanksgiving 
dinner with the folks"), or she generalizes with other persons 
who are so associated. Seeming "exceptions" are dangerous 
when they condense a whole human life in one sentence, not 
offering sufficient data for diagnosis or discussion. We ought 
to prefer to remain on firmer ground, using verified prin- 
ciples with proper safeguards against over-extrapolation. The 
principle of secondary reinforcement, and the fact of its in- 
dependence, will enable us to take further steps in analyzing 
complex behavior, and we shall find ourselves employing it 
in the remaining pages of this book. 

One final word on generality. Since the S r function of a 
stimulus is closely related to its S D function, we might hy- 
pothesize that the latter should show, to some degree at least, 
the same generality as the former. That is to say, we might 
ask whether an S D once established for a given response under 
a given drive, is capable of acting as an S D for other responses 
under other drives. Again the answer seems to be Yes, but 
subject to the qualification that the response to which the 
S D is transferred must already have been conditioned. Thus, 
Walker (1942) took an S D previously associated with a run- 
ning response and presented it during the extinction of 
bar-pressing. The stimulus had never been related to bar- 
pressing, but bar-pressing had been conditioned beforehand 
with the same reinforcement (food) as had the running. The 
result was an increased rate of pressing during the applica- 
tion of the S D . Apparently, an S D which denotes an "occasion" 
for reinforcement of one response can also be an "occasion" 
for a second response which has been similarly reinforced but 
never before connected with that S D . Data corroborating this 
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finding have been reported by Estes (1943, 1948). Not so 
definitely answered is the experimental question whether an 
S D may be transferred to another response being made under 
another drive. The proviso would once more hold that the 
response involved under the new drive must possess some con- 
ditioned strength, whether that strength was acquired under 
the new drive or the one prevailing when the S D was estab- 
lished. Clearly, the generality of S D combined with that of 
S r adds up to the possibility of greatly extended behavior 
variety and control. 

Some Parameters of S r 

What variables determine the strength of a secondary re- 
inforcement? Of the many we can guess at, experimental 
information is thus far available on only a few. This area of 
research is growing, however, and we are sure to increase our 
knowledge of it greatly in the near future. 

1. The strength of a secondary reinforcer may be expected 
to depend, for one thing, on the number of times it was cor- 
related with a primary reinforcement. Bersh (1950) has found 
this to be so experimentally. Using groups of rats, he paired a 
light with the dropping of a pellet into the food tray, varying 
the number of these pairings from o to 120 with different 
groups of subjects. When he later tested the light as an S r for 
reconditioning the response of bar-pressing, which had been 
previously conditioned and extinguished, he found that its 
strength was greater when its number of correlations with 
reinforcement had been greater, and that the strength ap- 
proached an asymptote beyond which an increased number 
of associations with reinforcement did not carry it. 

2. Bersh (1950) has also shown that the temporal relation 
between the S r and the primary reinforcement with which 
it is paired during training is important in determining the 
reinforcing value acquired by the S r . In one experiment, a 
light was arranged to begin o, i/, i, 2, 4, and 10 seconds be- 
fore the dropping of a pellet into the tray of the rat's living 
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cage. When the S r was later used to condition bar-pressing, 
the graph of the amount of conditioning obtained rose to a 
maximum for the S r which had had an interval of about one 
second between its onset and primary reinforcement during 
training. One second is the optimal interval, and intervals 

80 h- 



60 



U. 
O 

5 

03 

2. 
ID 
Z 



40 



20 






20 



40 60 

NUMBER OF S N -S R 



80 
PAIRINGS 



100 



120 



FIG. 53. The effectiveness of a secondary reinforcer as a function of 
the number of times it \vas paired with a primary reinforcer. S N is the 
neutral stimulus being conditioned as a secondary reinforcer; S R is the 
primary reiniorcer with which it is paired. (After Bersh, 1950.) 



100 



80 



fe 60 
a: 

Ltl 

co 40 

5 

ID 



20 



1 



S M -S R INTERVAL IN SECONDS 



10 



FIG. 54. The effect of the time interval separating S N and S R (defined 
as in Fig. 53) upon the power acquired by S N to act as a secondary re- 
inforcer. /After Bersh, 1950.) 



SECONDARY REINFORCEMENT 245 

either longer or shorter are not so effective in establishing a 
stimulus as an S r . Another experiment along these lines has 
been reported by Jenkins (1950). 

3. Pursuing the idea that an S r is established through 
training as an S D , Notterman (1950) did an experiment in 
which the discrimination training given a stimulus was 
varied. He used groups of rats as subjects, a runway as ap- 
paratus, and a light as the S D correlated with food reinforce- 
ment while absence-of-light was S A on non-reinforced runs. 
Each group of rats received the same number of S D trials 
(fifty in all over a six-day period), but for each group the 
number of interspersed S A trials varied. One group got no 
S A trials at all; another, 10 S A trials; another, 25; another, 50; 
another, 100. After training was over, each group was extin- 
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guished on the runway for 94 trials, each trial being given in 
the absence of light (S A ) and without reinforcement. The 
light was then introduced as S r from the gsth trial on to the 
105th which was the last, and the amount of facilitation of 
running produced by the light was compared for the several 
groups. There was a systematic increase in reinforcing power 
of the S r which paralleled the increasing number of S A trials 
it had been given during its discriminative training. 

4. We may guess, in the absence to date of experimental 
information, that the amount of primary reinforcement asso- 
ciated with a secondary reinforcer may also be of significance 
in determining its strength. Other things equal, if the pri- 
mary reinforcement is small, then the accompanying stimulus 
may become only mildly reinforcing; if it is large, the sec- 
ondary reinforcement may gain more strength. This relation- 
ship is suggested through analogy with basic conditioning 

(page 74). 

5. A very tempting view is that the schedule of association 
with primary reinforcement has its effect on secondary-rein- 
forcement strength. Any schedule of primary reinforcement, 
whether regular or periodic or aperiodic, may be associated 
with any schedule of presenting the potential S r . It would be 
surprising if all the possible combinations of schedule had the 
same effect on the strength of the secondary reinforcer, but 
whatever the effect we should be glad to know it. 

6. Again, we may guess that the effectiveness of an S r 
depends upon the delay with which it is applied to the re- 
sponse. This would be analogous to what has been said about 
the "delay-of-reinforcement gradient" for primary reinforcers 
(page 208). Exact measurement of the result of delaying an 
S r has not yet, however, been made experimentally. 

7. There is also to be considered the nature of the organ- 
ism as a determinant of the length and complexity of chains 
in which secondary reinforcement figures. The organism 
dealt with in an experiment is one of the conditions of that 
experiment. He will determine what stimuli may be used at 
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all for him by reason of his sensory capacities. He will require 
this or that number of reinforcements to develop a reaction 
strength which another species might achieve with half that 
number. His potentiality for acquiring a long chain of re- 
flexes may be quite limited in comparison with an organism 
far above him in the evolutionary scale. The resistance to 
extinction of a secondary reinforcement -its longevity may 
be enormously different from one level of organism to an- 
other: it may endure for life in a man, but expire after several 
responses in a chicken. 

"Token Rewards" and "Sub-Goals" 

In analyzing response chains, we noted, among other 
things, that each S D in the chain acts as an S r for the preceding 
response. We spoke of one response as producing the stimulus 
in the presence of which the next response is called for. Now 
suppose that we alter the situation and try to get a chain in 
which the next-called-) c or response is to manipulate or do 
something with the S D produced by the preceding response* 
What success would we have, and what could we learn from 
such an experiment? 

The experiment, we find, has been done successfully sev- 
eral times, and with a variety of animals ranging from the rat 
to the chimpanzee. By way of illustration, however, we shall 
limit ourselves here to studies of ape behavior, since these 
have been clear, dramatic, and highly suggestive of human 
conduct. Wolfe (1936) trained chimpanzees to insert small 
discs (poker chips and brass slugs) into the slot of a vending 
machine which automatically rewarded each response by 
delivering a grape. Subsequently, the animals quickly learned 
to move the lever of a "work apparatus" which provided discs 
that could be exchanged for grapes in the vending machine. 
In a number of variations of the experiment, they demon- 
strated their ability (i) to discriminate food tokens (white 
discs) from non-food tokens (brass discs); (2) to select blue 
tokens in preference to white tokens, and white in preference 



248 PRINCIPLES OF PSYCHOLOGY 

to brass, when the blue procured two grapes, the white pro- 
cured one, and the brass none at all; and (3) to select tokens 
in accordance with their prevailing needsfor example, black 
(food-getting) tokens were chosen when the animals were 
hungry, and yellow (water-getting) tokens were chosen when 
they were thirsty. Even when the disc-for-grape exchange was 
delayed, the subjects stayed at work obtaining discs for a con- 
siderable time and saved them until they could be used. Discs 
that could be 'traded in' immediately had somewhat greater 
incentive value than those which could not be exchanged 
until some time later; but the animals would readily lay in a 
supply of twenty or thirty discs for future use. Cowles (1937) 
confirmed many of Wolfe's observations and added new data. 
Most strikingly, he showed that apes were able to form a 
number of discriminations, some of them quite complicated, 
solely on the basis of token reward. Thus, they were able to 
master color, pattern and size discriminations in a standard 
discriminative situation, after which they carried their tokens 
to another room and exchanged them for raisins. 

There is no reason why you should not have been able to 
predict in advance the outcome of these studies, since you are 
already equipped with ample background. The first stage of 
Wolfe's experiment involved a straightforward operant, disc 
insertion into a machine for food-reinforcement, that is 
analogous to the rat's bar-pressing. (This applies also to a 
host of studies in which a tool or a "manipulandum" is used 
to obtain foodfor example, string-pulling, use of rakes, box- 
stacking, and so on.) In the preferential selection of blue 

(two-grape) tokens, the reflex S D (blue disc) R has the 

advantage of a greater amount of reinforcement than the S D 

(white disc) R reflex, just as if there were two bars in a 

cage, with a rat learning to press one predominantly because 
it gave more or larger pellets than the other. Related to this 
choice is also the action of the negatively reinforcing charac- 
ter of the "hard way" (page 177), with the blue disc again 
benefiting because it gives more-return-for-less-work. In the 
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same situation, the brass disc response is extinguished as the 
usual result of non-reinforcement, although occasional selec- 
tion of these discs would be expected (and was observed) on 
the basis of stimulus generalization. In both Wolfe's and 
Cowles' studies, the discs were employed in the chain as sec- 
ondary reinforcements for conditioning other responses. The 
fact that, after procurement, the next response was a manipu- 
lation of the S r D (the disc) is, on second thought, actually no 

different from the S D (visual pellet) R (seizing) in the chain 

that follows bar-pressing. The "hoarding" of discs is itself 
learned through the ultimate, though delayed, reinforcement 
provided when the discs can once again be exchanged for 
food. And so on. There is apparently no aspect of this re- 
search which taxes unduly the principles with which we are 
familiar. 

Experiments of this sort have also been called "sub-goal" 
studies. Both ''sub-goal" and "token-reward" are ways of de- 
noting the action of secondary reinforcements, and we should 
not permit ourselves to be disturbed by synonyms. "Token- 
rewards" were so called merely to distinguish them from 
"primary" reinforcement. The chimpanzee experiments 
arouse our interest because they come so close to our own 
social and economic behavior. In human society, money and 
other secondary reinforcements such as "prestige" and "com- 
munity approval" assume, through training, the status of 
generalized rewards that have potency for practically every- 
thing we do. Chimpanzees, like men, will apparently do 
many things for money. 

Secondary Negative Reinforcement 

While we have spoken thus far of conditioned reinforcers 
which act positively, we must not overlook the possibility of 
conditioning a stimulus as a secondary negative reinforce- 
ment. In the experimental study of anxiety (see Chapter 9), 
an S D is arranged to precede by a fixed time the onset of an 
inescapable noxious (negatively reinforcing) stimulus like an 
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electric shock. The result, briefly stated, is that bar-pressing 
in the interval-before-shock is depressed or halted until the 
shock is past. We ask whether the S D in this situation be- 
comes also a secondary negative S r , and we can confirm the 
fact that it does by showing that an animal can be reinforced 
by its removal. 




FIG. 56. An apparatus for studying escape from a secondary negative 
reinforcer. The left compartment is painted white, and contains a floor 
grid for delivering electric shocks. The right compartment is painted 
black. The animal can escape from the left to the right by operating a 
door-opening mechanism in the wall between. (From Miller, 1948.) 

Evidence for such an effect has been obtained by several 
investigators. Hefferline (1950) presented a series of faint 
clicking sounds regularly in advance of a strong light from 
which rats could escape by bar-pressing. Whenever an animal 
responded during the S D period, the clicking was stopped 
and, subsequently, no light was presented. After long-con- 
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tinued combination of clicking-followed-by-light, it became 
clear that the bar-pressing response could be kept going for 
a time solely because it cut out the clicks. This is related to 
an earlier study by Mowrer and Lamoreaux (1942) in which 
a buzzer was sounded for six seconds prior to an electric shock 
which could be eliminated when the subjects (rats) ran from 
one half of an experimental chamber to the other. If the rat 
made the running response during the six-second period, the 
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FIG. 57. After considerable shocking in the left (white) compart- 
ment of the box in Figure 56, page 250 without any opportunity to escape, 
the wheel was made functional, and the animals were permitted to escape 
to the right (black) compartment by rotating the wheel to open the 
door. The curve shows the average latency changes of the wheel-turning 
response on the first 16 trials. (After Miller, 1948.) 

sound was discontinued and the shock was not given; if he 
did not respond during this period, the buzzer was continued, 
with the shock added, until the escape response occurred. 
Ten days of buzzer-shock combination, with ten combina- 
tions per day, were sufficient to establish the negatively rein- 
forcing effect of the buzzerthat is, to strengthen the running 
response through the termination of the sound. In a later 
experiment, Mowrer and Lamoreaux (1946) demonstrated 
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the same effect by conditioning a buzzer-terminating response 
(jumping into the air) that was distinct from the running 
response that ended the shock. Still more recently, Miller 
(1948) showed that both a wheel-rotating and a bar-pressing 
response could be strengthened when the only reinforcement 
was escape from a white box (termination of a negative S r ) 
in which they had formerly been shocked. 
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Fig. 58. After the wheel turning response (see Figure 57, page 251) was 
learned, the wheel was made non-functional and the new response of 
bar-pressing to open the door was substituted. The white compartment 
still retained its negative character, since the animal stopped rotating 
the wheel and acquired the bar-pressing response as shown in this curve. 
(After Miller, 1948.) 

In this connection, we may cite two observations made by 
Wolfe in the course of his token-reward studies. One of his 
apes, accustomed to the use of light-blue tokens to obtain 
'activity privileges' (return to living-quarters or play with the 
experimenter), twice employed the discs in what was appar- 
ently an attempt to escape from the test situation: once when 
a white rat was introduced into the experimental room, and 
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once when a photographer appeared on the scene to take 
pictures of the animal's behavior! 

One of the commonest appearances of secondary negative 
reinforcement is in the reflex chains associated with the mov- 
ing-about of an organism in its environment. Locomotor 
responses which bring an animal (or human being) into the 
presence of S D 's which have been correlated with painful or 
noxious stimuli (primary negative reinforcements) are de- 
pressed or weakened, so that he seems to avoid those situa- 
tions, staying in 'safe* areas. Moreover, an experiment by 
Bugelski and Miller (1938) showed that rats, placed in a run- 
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FIG. 59. The tendency to move away from a place where negative 
reinforcement was applied. The strength of the tendency is measured 
in terms of the time taken by the animal to get started when placed 
at three different distances from the point at which he had been shocked. 
The experimenters refer to this as a "spatial gradient." (After Bugelski 
and Miller, 1938.) 

way at different distances from the point at which they had 
received electric shocks, ran away with a speed that was pro- 
portional to the nearness of the punishment area the farther 
from the negative S r the slower they ran. 

We can observe this phenomenon in many other forms of 
behavior. It is seen especially in the mastery of those skills 
where a slip means disaster. A choking mouthful of water 
keeps the beginning swimmer's head up and mouth closed 
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for a long time afterward; and the 'no-hands' cyclist quickly 
seizes the handle bars when a loss of equilibrium is threat- 
ened. 'Safe' places are those in which positive, or at least no 
negative, secondary reinforcements are found; and they are 
'preferred' (the organism spends most of his time in them) 
to areas in which the S D 's have become negative reinforcers 
through association with noxious stimuli. 'Home' is such a 
place to many persons; whereas strange places may be 
crowded with unknown and untrustworthy S D 's, foretelling 
unpredictable, even fearsome, events. What we call 'home- 
sickness' may be in good part the absence from one's environ- 
ment of accustomed, and positive, S r 's. A child raised by 
much-visiting and much-traveling parents may be 'at home' 
anywhere, especially if his parents are with him. Not unlike 
this, is the 'hospital ids' reported by nurses and physicians in 
patients of long residence in a hospital who are dejected by 
the prospect of discharge, and who may develop a relapse, 
real or feigned, that prolongs their stay in a friendly, protec- 
tive environment. 

Secondary Reinforcement and "Feelings" 

Among the topics which psychologists, ancient and mod- 
ern, have traditionally discussed is that of jeelings. This topic 
can take one into many blind alleys, especially if one were to 
follow the everyday uses of the word. In conversation, the 
word has no rigorous meaning. We use it in such statements 
as / feel reluctant., My feeling is that one should compromise, 
That feels cold, I feel sad, I feel that I must, and Goya's etch- 
ings have such feeling! There are thousands of phrases in 
which we employ either the verb or the noun, and thoughtful 
students have long ago agreed that these usages cannot pos- 
sibly delineate a single problem or field of investigation. 

Progress in finding a psychological meaning for the term 
began in the last century and has continued into the present. 
In one famous attempt to delimit the field, Wilhelm Wundt 
(1896) proposed a 'tri-dimensional' theory. All true feelings 
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were to be described in terms of three coordinates: degree of 
subjective pleasantness-unpleasantness; tension-relaxation; 
and excitement-calm. Thus, any feeling could be adequately 
described by reference to its place along each of these con- 
tinua. One feeling might be pleasant, relaxing, and calming; 
another might be unpleasant, tensing, and exciting; and 
others would portray still different combinations. This view 
carried enough prestige to initiate many experimental at- 
tempts to find behavioral counterparts (changes in blood- 
pressure, pulse-rate, respiration, and so on) of each of the 
alleged dimensions. A great deal of conflicting data resulted, 
and the whole enterprise is now generally conceded to have 
been relatively unproductive. Titchener (1908) offered a 
pared-down introspective account of feelings which elim- 
inated two of Wundt's dimensions, retaining only pleasant- 
ness-unpleasantness, but the search for bodily correlates of 
this dimension was still dogged by failure. Beebe-Center 
(1932), a present-day authority in this field, finds that no 
single relation between pleasantness-unpleasantness and 
either respondent or operant behavior has been accepted 
without question. The best that he can tell us is that all 
theories of the relation between feeling and 'locomotor' (his 
equivalent of operant) response "depend on the correlation 
of pleasantness with seeking, approach, acquisition; and the 
correlation of unpleasantness with avoidance, withdrawal, 
rejection." 

The bearing of reinforcement upon such verbalizings as 
"pleasant," "unpleasant," and their synonyms has not been 
fully explored, but is likely to prove the most profitable 
approach of all to this problem. Reinforcements appropriate 
to our "drives" (Chapter 9) commonly evoke "pleasant" or 
its equivalent as food when one is hungry; while non-rein- 
forcement, negative reinforcement, and the removal of rein- 
forcement apparently occasion "unpleasant." Even more to 
the point is the action of secondary reinforcers. Many foods 
are nourishing but not equally "pleasing" (Mother's cooking 
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is the best!). It would seem that the discriminative stimuli 
which foretell positive reinforcement raise within us "pleas- 
ant and joyful anticipation"; those foretelling negative rein- 
forcement cause "unpleasant and fearful anticipation" or 
"anxiety." But still we are faced with the task of isolating the 
S D 's that lead us to emit such words. Theorists have long 
argued that "feelings" are related, not to objects or other 
external S D 's, but to the effect they have upon us ("If a child 
is pleased by a new red toy, then both red and pleasantness 
are features of his experience, but it is the toy that is red and 
the child who is pleased" Beebe-Center). It may be that 
whenever we say an object is pleasant or unpleasant our state- 
ments are based upon intervening responses (of approach, 
withdrawal, or some other sort) which follow directly upon 
exteroceptive stimulation and themselves determine the overt 
spoken outcome. In such a case, "feelings" and "meanings" 
would have much in common, and the problem of reported 
"feelings" would be similar to the problem of reported "emo- 
tions" (see Chapter 10). This parallel is supported by the fact 
that reaction times of the verbal responses or judgments 
'pleasant* and 'unpleasant' are generally slower than ordinary 
reaction times a finding that would be expected on the basis 
of the intervening response that serves as the S D for the 
judgment. 

The suggestion is strong that positive secondary reinforce- 
ment figures in the explanation of 'joys' like the joy of re- 
visiting 'the scenes of our childhood/ Apparently the source 
lies in the reinforcements connected with familiar places, 
people, things, and actions. The loss or absence of old sec- 
ondary reinforcers, on the other hand, seems often to be at 
the heart of our 'sorrows/ Homesickness; dejection at the 
death of a friend; regret at the passing of old landmarks; 
yearning for 'the snows of yester-year' do not these suggest 
the loss of S r - D 's? Indeed, the tie-up of secondary reinforce- 
ment and "feeling" seems to pervade our whole lives as social 
organisms. The process of socialization has to do with acquir 
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ing those modes and standards of behavior which are typical 
of the society in which the individual grows. Because all of 
our reinforcements occur in a given social environment, and 
because society often makes reinforcement contingent upon 
our doing and saying 'the proper things/ we may come to like 
and prefer as 'naturally pleasing* our own particular way of 
life, and this or that style of music or art. Even our group 
prejudices, our small talk, our ambitions, our religion, our 
special forms of virtue (and vice), appear to have their roots 
in this basic principle of human behavior. The very language 
we speak is an edifice reared upon secondary reinforcement. 
In a foreign land, we thrill to the sound of our native tongue; 
at home we enjoy the writer or speaker in whose verbal skill 
dwell many of our strongest secondary reinforcements. We 
find that poet satisfying who artfully sets up in us the verbal 
chains that he is using, so that his next word, and his next, 
come almost as the completions of what we were about to 
say! In the verbal interplay between writer and reader, be- 
tween speaker and listener, it is the writer's or speaker's art 
to start up in us those 'thoughts' which agree with his, bring- 
ing us together in the same responses, making us say the same 
things, or supply the same rhyme. Such an outcome is truly 
'pleasant/ even overpowering, because the poet and we have 
been speaking in the same way together, exchanging our 
'desires' and 'passions' and our tears in perfect harmony a 
relationship as near to the purely social as we can conceive, 
and as matchless an example of secondary reinforcement as 
man can create or find. 

Secondary Reinforcement in Social Behavior 

As implied above, a stimulus controlling behavior as an 
S D , S r , or S r D need not stem from the inanimate environment 
alone. Other organisms (or, the stimuli emanating from 
them) can act in these ways, too, and therein lies a fact of 
utmost significance to human as well as other biological life, 
l h^h*x/i'or may be described as behavior for which the 
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reinforcing or discriminative stimuli are, or have been, 
mediated by the behavior of another organism. By "me- 
diated" we mean "arising from, or in connection with," and 
there is no intention of straining the word's connotation. We 
would neither (i) include as social the delivery of a pellet by 
the experimenter to a bar-pressing ratit may be a social 
situation for the experimenter, but it is not for the rat; nor 
(2) exclude as non-social the behavior of the marooned sailor 
who speaks to himself or makes clothes out of skins, since 
both activities have been socially acquired. 

From birth on, social stimuli play a large part in the life 
of human beings. Many scientists, indeed, have thought that 
society itself has its origins in the protracted and utter de- 
pendence of the human infant. However that may be, 
parents, and especially the mother, are among the first sec- 
ondary reinforcers of a social sort to enter the infant's ken. 
Their discriminative and reinforcing potency are quickly 
established by their continual association with food, warmth, 
relief from pain, and the like. If, however, the child is reared 
by a nurse, then she becomes the ever-present secondary rein- 
forcement, and it is commonly seen that attachment to the 
nurse replaces that to the mother. Psychiatrists have pointed 
out that, in adolescence and before, the first sexually interest- 
ing objects may be the parents, brothers, or sisters. Within 
the relatively restricted social environment of the child, the 
few organisms who serve as the accompanying stimuli when 
reinforcement is given or withheld can acquire an over- 
spreading and life-long grip on his behavior. With increasing 
age, the child's widening excursions beyond the home pro- 
vide an increasing range of secondary reinforcers to control 
his reactions. School, friends, clubs, and related activities of 
all sortsthese push upon him the stimuli which are the ever- 
present signals and accompaniments of ultimate reinforce- 
ment, the S D 's and the S r 's which are the warp and woof of 
his life in society. 
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Although the theme of anxiety will be developed later (see 
Chapter 9), we should recognize in passing that social 
stimuli can also act as secondary negative reinforcers. As we 
shall see, the behavior which occurs in the interval between 
an S D and a negative reinforcement is characterized by a 
depression of operant activity and the onset of respondent 
changes. The laboratory study of anxiety (induced experi- 
mentally by using the S D -negative reinforcement sequence) 
has numerous implications for everyday affairs. A child raised 
in a rigid and over-disciplined home, will suffer many punish- 
ments through the prescriptions and prohibitions imposed 
upon him. The adolescent, having to cope with his newly 
acquired social status, encounters many pitfalls and rebuffs 
before he learns acceptable modes of behavior. In these and 
similar instances, persons are the appliers of punishment or 
emotion-arousing stimuli, and, by this association, themselves 
become secondary negative reinforcements. Through stimu- 
lus generalization, other persons may be included within this 
category. The result may be a depression of activity in their 
presence, avoidance of people, seclusiveness; in short, there 
may develop anti-social and maladjusted behavior so alarm- 
ing to the clinical psychologist or mental hygienist. Where 
external positive reinforcements of the secondary sort are 
radically reduced, and where negative reinforcements crowd 
too strongly upon a man, the final outcome may be complete 
'withdrawal from the world' as seen in the psychosis of 
schizophrenia. Even in the normal development of children 
it has long been remarked that age is accompanied by an 
increase in the sheer number of things feared. The six-month- 
old infant has few fears; the six-year-old has many. This is the 
upshot of increasing experience with negative reinforcement 
in an ever enlarging world, with the consequent multiplica- 
tion of S D 's for fear. Lower animals may learn most of their 
fears at the hands of their natural environment; man gets 
most of his at the heedless or unmerciful hands of his fellow 
creatures. 
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Taking Stock and Looking Ahead 

Far back in this book we said that human behavior is the 
final object of interest to most psychologists, as it is to the 
layman. In reaching our goal, the principle of secondary rein- 
forcement will be of great analytical assistance. When added 
to the other functions of stimuli, it gives us a powerful and 
indispensable tool for the solution of many vexing and ab- 
sorbing problems of human action. It will not escape the 
thoughtful student that the following points go far toward 
explaining the elaborate and ramified behavior of organisms 
high in the evolutionary scale, and of old organisms as against 
young. Given an undeveloped creature to whom few primary 
reinforcements may be relevant, the following facts allow us 
to augment our control over his maturing operant repertory 
in ever increasing fashion. 

1. A stimulus that occasions or accompanies a reinforce- 
ment acquires thereby reinforcing value of its own, and may 
be called a conditioned, secondary, or derived reinforcement. 
A secondary reinforcement may be extinguished when re- 
peatedly applied to a response for which there is no ultimate 
primary reinforcement. 

2. A secondary reinforcement is positive when the rein- 
forcement with which it is correlated is positive, and negative 
when the latter is negative. 

3. Once established, a secondary reinforcement is inde- 
pendent and non-specific; it will not only strengthen the 
same response which produced the original reinforcement, 
but it will also condition a new and unrelated response. 
Moreover, it will do so even in the presence of a different 
basic motive. 

4. Through generalization, many stimuli besides the one 
correlated with reinforcement acquire reinforcing value- 
positive or negative. This point was not stressed in the pres- 
ent chapter but it should require no elaboration here. 

Finally, it should be remembered that, in everyday life, 
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stimuli are not single, that responses are not all alike, and 
that learning is not all done under the same drive. Many 
stimuli are present when a response is conditioned, all be- 
coming discriminative and secondarily reinforcing; many 
responses are capable of obtaining the same reinforcement; 
and more than one drive (hunger, thirst, etc.) may be satisfied 
at various times by the same responses and in the presence of 
the same stimuli. Truly, the study of behavior is a lofty chal- 
lenge to scientific imagination and method! 

NOTES 

There is no book on secondary reinforcement to which we can send 
you for additional information. Hull, however, in his Principles of 
behavior (1943), has devoted a chapter to this important principle. 
(Almost any text in social or abnormal psychology will, of course, 
provide you with numerous examples of its operation.) You ought now, 
however, to be able to follow with little trouble the growing experi- 
mental literature in this area. Access to these reports may be gained 
by way of the Psychological Abstracts, a journal that contains short 
summaries of all articles and books assumed to have any interest what- 
ever for psychologists. 

In terms of the relation of behavior to the environment of an organ- 
ism, the present chapter takes us almost as far as we can go. No more 
than two functions of stimuli remain to be considered, and nothing 
new will be said about the basic processes of conditioning, extinction, 
and the like. In fact, if you have followed our discussion up to this point 
with moderate success, you possess most of the available tools for 
behavioral analysis. From now on, we shall be interested in the kind 
of environmental control that induces changes in the state of an or- 
ganismas when we deprive an animal of food or water. In our talk 
of motivation and emotion in the next two chapters, you will see that 
we do little more than shift the focus of our attention say, from 
reinforcement to some previously unmentioned conditions under which 
stimuli become reinforcing. In Chapter 11, we shall try to point out 
the direction one might profitably take in carrying the principles into 
the realm of human interaction. 
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IT is by the pleasure of exertion, and the pain of inexertion, that we are 
roused from that indolence, into which ... we otherwise might sink: 
as we are roused, in like manner, by the pleasure of food, and the pain 
of hunger, we take the aliment that is necessary for our individual 
sustenance; and though the mere aliment is, indeed, more important 
for life, it is not more important for happiness than the pleasure of 
activity which calls and forces us from slothful repose. 

Thomas Brown, Lectures on the Philosophy 
of the Human Mind, 18*3 



A New Line of Inquiry 

We have been occupied in exploring the principle of rein- 
forcement and the manner in which the environment con- 
trols organisms by way of stimuli. From this single starting 
point, we have been able to take large strides in understand- 
ing why men and lower animals behave as they do. Yet 
students of psychology, in past times and present, have felt or 
known that a description of behavior would be incomplete 
without taking into account another kind of controlling fac- 
tor which today we call motivation. 

Common experience reveals the existence of this factor so 
vividly that men everywhere have evolved a vocabulary and 
set of ideas for explaining and speaking of it. Growing up in 
a social community as we do, we are taught the prevailing 
words and concepts. These seem consequently, to be right, 
natural, and but common sense. Unhappily, there are few 
areas in psychology where popular notions contain a more 
alluring blend of correct and incorrect observations, of valid 
and biased thinking, of wise and foolish conclusions. Our 

202 
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first task, if we are to make progress in this new line of 
inquiry, is to begin aright. 

Ordinarily, our questions about motives occur in connec- 
tion with complex types of human interaction. We feel that 
unless we know the underlying motives we shall not be able 
to deal effectively with ourselves and others in the many 
important affairs of everyday life. Why does each person seek 
popularity? Why do men marry as they do, and whofn they 
do? Why do men fight and take pleasure in killing? Our 
analysis cannot start with such samples of behavior, but rather 
with fundamentals and experimentally verifiable data. The 
initial aim is to get a proper foundation for further building, 
not a blueprint for a superstructure which may require end- 
less revision, or may be totally useless, as the basic facts come 
to be known. Only in this way can we make the scientific 
progress we want and need. 

The Need for the Concept of Motivation 

Oddly enough, it is the very first step toward comprehend- 
ing motives (drives is a synonym) which often proves the 
hardest for the beginning student to take. What is demanded 
is that he lay aside long-favored personal opinions, and ex- 
amine de novo the reasons for believing that a science of 
behavior cannot get along without motivation! Only if there 
were observations of behavior not encompassed by the prin- 
ciples set forth in the earlier chapters, would we be required 
to formulate new concepts to deal with the data. The present 
chapter is, in fact, devoted to just such observations. Let us 
take the 'hunger drive* as an example. 

Neither rats nor men eat continuously, but at fairly def- 
inite intervals and in fairly definite amounts. This fact is 
banal enough to most persons, who would probably say that 
an organism eats when 'hungry* and stops when 'full/ But it 
may do us good to question the obvious, wherein at times 
have lain concealed some astonishing natural phenomena. 
So, with a suspicion that this is a phase of behavior worth 
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inspecting, we take to the laboratory; and soon our critical 
sense is prodded by further discoveries. We find that a bar- 
trained rat, left in his box or working-cage to obtain all of 
his meals by pressing, will eat in a fairly regular cycle and 
only so much at any one time. In a twenty-four-hour period, 
he may eat on ten or twelve occasions only, each time at a 
slow but constant rate. When given but one meal a day, 
always at the same hour, a curve like that in Figure 7 (page 
43) will be produced. When placed in the box, right after 
a meal, he will not eat at all. Moreover, an animal cannot be 
conditioned to salivate (Type S) or press a bar for food 
(Type R) unless hungry. 

These facts may leave you unmoved. Did we not assume all 
along that hunger or some other drive is required in experi- 
ments dealing with stimulus-controlled behavior? Yes, we 
did; but now we realize that we have on our hands a phenom- 
enon that deserves special consideration. For, trimmed down, 
our observations are that (i) depriving an animal of food is a 
way of increasing the strength of a conditioned reflex like 
bar-pressing; that concurrently, (2) many other reflexes rise 
in strength, such as reaching, seizing, and chewing; that (3) 
with sufficient intake of food (satiation), these reflexes drop 
in strength to zero; and that (4) food-deprivation is itself a 
prerequisite for using food as reinforcement that a rein- 
forcer is such by virtue of some operation that makes it act so. 

It is in this way that the need for the concept of motivation 
as a new variable in behavior arises. It is because responses 
can be controlled in other ways than by reinforcement, that 
a new descriptive term is called for and a new behavioral 
concept emerges. What shall we say about occurrences like 
feeding and fasting which affect reflex strength? How many 
types of such occurrences are there? How can we exercise 
control over them and, through them, over the organism? 
How many degrees of each can be set up? How can they be 
measured? What new experiments do they suggest, and what 
new knowledge will they generate? How are they related to 
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conditioning, extinction, discrimination, and the like? These 
questions and many similar ones soon lose their 'obvious' 
character. 

The Nature of Drives 

To the question What is drive?, we must now answer that 
drive is the name for a fact the fact that certain operations 
can be performed on an organism (for example, depriving it 
of food) that have an effect upon behavior which is different 
from that of other operations. Drive is not a thing, but simply 
a word we use to show our recognition that behavioral func- 
tions which may depend on reinforcement are also modifiable 
by another influence, one exerted by occurrences which do 
not involve reinforcement. 




TIME 

FIG. 60. Illustrating how varying drive levels during extinction act 
as a parameter of the extinction curve. The higher the drive, the greater 
the number of responses made and the more rapidly they are emitted. 

The observation that 'neither rats nor men eat continu- 
ously* requires an analysis that, after all, is rather distant 
from everyday levels of conversation. Words like motive and 
drive, drawn from the common vocabulary, do not explicitly 
denote the relation with which we are now concerned. Per- 
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haps the idea will become more acceptable as we examine it 
from different angles and express it in different ways, with a 
few experimental examples. 

i. Drive in extinction. It has been quite well established 
(e.g., Perin, 1942) that the process of extinction is influenced 
by the drive factor. The experimental design, urged by the 
initial observation that a bar-trained rat will not press when 
satiated, is simply this: Expose groups of animals to equal 
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FIG. 61. Experimental data on the relation of drive level to the 
number of responses made in extinction. Four groups of rats were con- 
ditioned to bar-pressing under the same conditions of hunger and given 
equal numbers of reinforcements. Subsequent extinction was carried 
out with each group under a different amount of deprivation: i, 3, 16, 
and 23 hours. Extinction was carried to a criterion of five minutes. The 
plot shows the average number of extinction responses for each group. 
(From Hull, 1943, as adapted from Perin, 1942.) 



amounts of training under equal deprivation, and then ex- 
tinguish the response under different lengths-of-deprivation 
to see whether the number of pressings they make co-varies 
with deprivation time at extinction. A fair number of experi- 
ments have been made on this question, with good agreement 
that deprivation time does act as an important variable. The 
idealized finding is shown in Figure 60 indicating that, in 
addition to the height of the cumulative response curve 
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reached, the early rate of emission is also affected. Such ex- 
periments give substance to the concept of motivation as a 
determinant of behavior supplementing reinforcement. 

Technically, we may 
say that drive, here 
specified in terms of 
deprivation time, is 
a "parameter" of ex- 
tinction. 

2. Drive in pe- 
riodic recondition- 
ing. Our second ex- 
ample of how the 
motivation concept 
derives from the 
effect of an opera- 
tion like food-depri- 
vation, is taken from 
periodic recondi- 
tioning. An animal 
placed on a schedule 
of P-R at a fixed 
interval of, say, three 
minutes, will give 
cumulative response 
curves of varying 
slopes. That is, the 
rate of responding, 
60 within wide limits, 
increases with in- 
TIME IN MINUTES creased deprivation 

FIG. 62. Cumulative response curves for ( see Figure 62). Sim- 
one rat under P-R on six successive days in ilar results are ob- 
which no food was given beyond the pellet tained if, instead of 

reinforcements during the sessions. The in- _, ^ * . 

creasing drive is reflected in the rising slopes var y m g deprivation 
of the curves. (From Skinner, 1938.) time, we deprive all 
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animals at the same time but provide them with different 
amounts of food just before using them in a P-R session (see 
Figure 63). 

This introduction to the nature of drive needs rounding 
out, and we shall attempt to do this in succeeding sections. 
Before going on, however, we should mention two miscon- 
ceptions of drive which our analysis enables us to sidestep. 
These are the twin errors of identifying the drive exclusively 
with either the reflexes or the operations involved. Taking 
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FIG. 63. Cumulative response curves for the same rat under P-R with 
varying amounts of pre-feeding. The curves were taken on different days. 
The rat was 24-hours hungry each time, but was pre-fed with the amount 
in grams indicated on each curve. (From Skinner, 1938.) 

hunger again as our example, it would be dangerous to say 
that, merely because an animal eats, he is hungry. There are 
many variables which may influence eating behavior. Emo- 
tional excitement can make an animal stop eating, or the 
introduction of another animal into the situation may cause 
him to resume eating after he has stopped; and human beings 
may eat for social reasons or starve for political ones. Mere 
eating at any given time is an inadequate criterion of hunger 
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unless we know what preceded the eating. On the other hand, 
identifying a drive with its establishing operation, such as 
deprivation, is also wrong. There are many deprivations 
which are of no consequence in determining behavior (for 
example, depriving a rat of organ music) which do not estab- 
lish a drive. The truth of the matter is that when we notice 
changes in reflex strength we look for corresponding opera- 
tions, the two things together leading us to infer the drive. 
It is because everyday observation gives us prior knowledge 
that food deprivation has an effect on response that we accept 
it alone as 'hunger*. We would gain little from argument 
as to whether a starving mystic, in the throes of his elation, is 
'hungry' or not! And phrases like 'a twenty-four-hour hungry 
rat* or 'a drive level of sixteen hours/ commonly used as a 
concession to convenience, should always be understood to 
mean 'twenty-four or sixteen hours of deprivation.' 

Drive as an "Internal State" 

It would be cumbersome in talking and thinking about 
drive to have always to do so in long and meticulous state- 
ments about its nature as 'a change in reflex strength attrib- 
utable to some operation performed either experimentally 
or naturally . . .' and so on. The word drive is a shorthand 
device, a convenience of expression, for both the concept of 
motivation and the facts subsumed thereunder. We do no 
harm in using it as long as we know that it is an ultimately 
dispensable word, and that we can return to our analysis of 
it when necessary. It is like the word force in physics where, 
in the end, it must be refined and analyzed, but where it is 
a handy word in many connections. The danger associated 
with the psychologist's drive and the physicist's force is that 
of letting oneself be misled into the false issues that are raised 
by vague everyday references. 

Similar considerations apply to the designation of drive as 
an 'inferred internal state.' This phrase simply recognizes 
that the effects of an operation like food deprivation are ram- 
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ified throughout both the physiology and behavior of the 
Drganism. Physiologically, there are numberless reverbera- 
tions and correlates of the operation. These, in a scientific 
division of labor, are within the province of the physiologist, 
the chemist, the biophysicist. The concept of drive, however, 
is a behavioral one; and the physiological effects are not the 
drive. Behaviorally, a drive is marked by concurrent changes 
in the strength of many reflexes. A food-deprived animal not 
only presses a bar more frequently and eats more vigorously, 
but many other reflexes change at the same time (e.g., climb- 
ing, sniffing, and running about) if the bar is not present, 
giving us the impression of increased general activity and 
restlessness. This makes it appear that the operation has 
changed the creature in some over-all way, and makes the 
notion of a 'state of the organism' more palatable. 

Very few persons would question that the state is 'infer- 
rable* from the observations, or that it must be 'internal* 
in nature; and they would probably approve a formula such 
as this: 



Observable 
ist Term or Event 

OPERATION 

(e.g., food 
deprivation) 



Construct or 
Intervening Term 



(e.g., of hunger) 



Observable 
and Term or Event 

REFLEX CHANGES 

(e.g., strengthening 
of bar-pressing, 
weakening of grooming 
behavior, etc.) 

Yet, despite the validity of the observations, the phrase 'in- 
ferred internal state* adds nothing to our knowledge of 
drive, because it denotes nothing beyond that which is con- 
tained within the observations themselves. It is, once again, 
a convenience of expression, and we might dispense with the 
term altogether if it were not for the effort involved in strain- 
ing for technical purity. 

We have given much space to the clarification of the nature 
of drives, and the difficulties which come from popular us- 
ages. If our points are kept in mind, it will be unnecessary for 
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us to forego the advantages of ease and succinctness which 
inhere in our ready-made and ingrained vocabulary. Instead 
of devising new symbols, we can adopt familiar speech if at 
the same time we remember that we have an agreed-upon 
analysis to fall back on whenever it is needed to reason out 
new motivational problems or re-examine old ones. We shall, 
then, henceforth use expressions like 'establishing a drive/ 
'reducing a drive/ and others, because they are neat, but in 
every instance our abbreviated phrase can be translated into 
the more correct and expanded form that our discussion has 
supplied. 

Discovering Drives 

One of the commonest questions asked of students of 
motivation is How many drives are there? To this there is no 
answer but a tentative one. The number of drives which 
exist for any species of organism is not determined by any 
man's say-so, and no list can be given except in so far as the 
drives are known today. One who wishes to draw up a list 
must be prepared to defend it by citing his evidence for each 
one that he includes, since a drive is not a matter of opinion 
but of proof. He must be able to demonstrate that, as a con- 
sequence of some operation, reinforcement and conditioning 
are made possible; that the operation leads to changes in 
strength of a previously conditioned reflex and, concurrently, 
of others; and so on. And, later, if new evidence is forthcom- 
ing to show that a drive has been unwittingly omitted, he 
must be ready to accept it and add it to his list. 

The fact that evidence must be found for a drive means, of 
course, that drives are discovered. There is no way of telling 
in advance how many will be finally found, and all attempts 
at enumeration are provisional. The situation is not unlike 
that surrounding the discovery of chemical elements before 
Mendeleev's periodic table, or since modern atomic research 
showed how to create new substances. At neither time could 
the number of obtainable elements be predicted with cer- 
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tainty. The explorer into drives faces two problems. On the 
one hand, he may set out to discover drive factors in behavior 
which were not hitherto suspected. An example is that of the 
so-called "sub-hungers" which are established, not by gross 
food deprivation, but by particular deficiencies in diet, such 
as lack of calcium. Sub-hungers act on reflex strength to direct 
an animal's choice of foods (his selective eating) from among 
many that are simultaneously proffered. These sub-types of 
hunger were unearthed in recent years after chemistry and 
dietetics gave inspiration to the idea and made possible the 
appropriate depriving operations. On the other hand, the 
explorer may guess that a drive influence is at work but may 
not have the establishing operation at his command. The 
operations for hunger and thirst have been known to all men 
throughout the ages, but this is not true of such a well-rec- 
ognized drive as sex. Although, in the females of many 
species, this drive has long been known to undergo cyclic 
changes as a function of time, it has only recently been pos- 
sible to exercise regulation through glandular extracts, surg- 
ical incursion into sex tissues, and so on. The discovery, 
classification, measurement, and study of any drive are in- 
extricably related to the identification of (and, hopefully, 
mastery over) its establishing operations. 

In addition to their incompleteness, lists of drives would 
be different for different types of living organisms. An opera- 
tion which strongly affects the behavior of one species may 
have little or no import for another. Broadly viewed, the 
amount of similarity in motivation from one species to an- 
other depends upon their evolutionary closeness, but this 
does not alter the need for separate listings and proofs. For- 
tunately, we are not here concerned with cataloguing, but 
with an introduction to the nature of drives, to some impor- 
tant ones that human beings share with other organisms, and 
to the problems of experimental investigation. It is to these 
ends that our later discussion of a few representative drives 
is mainly directed. 
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The Measurement of Drives 

If you have followed the preceding points, you should have 
no difficulty with the next. The matter of measuring drives 
(or drive level, or drive strength) is always approached incor- 
rectly by one who thinks of a motive as a substantive thing 
rather than a set of relations between an establishing opera- 
tion and behavioral changes. You can readily see that what 
we measure is behavior; that, in practice, any method pur- 
porting to measure drive is one that measures reflex strength 
as it changes with different degrees of a selected operation. 
The establishing operation is our independent variable, the 
behavior our dependent variable; the former is specifiable as 
to kind and degree, the latter is measured for extent of 
change. The concomitant variation of the two gives rise to, 
and defines, the concept and the problem of motivation. 

It follows, therefore, that there are as many ways of getting 
at any drive as there are behavioral effects that can be meas- 
ured. There is no single consequence of an operation like, 
say, food deprivation, that must be depended on; any con- 
comitant change in response may be taken. In studying or 
discovering drives, it is important to find an appropriate re- 
sponse that undergoes clear enough, and large enough, 
changes to yield significant co-variation with the operation. 
A poor choice can make it seem that the operation has had 
no effect on behavior (is not really drive-establishing), but 
trial with another reflex may give immediate success. 

It would be superfluous, even if it were possible, to catalog 
all the reflexes that have been measured in studies of motiva- 
tion. The particular one selected by an investigator is only 
incidental to his purpose of learning something about drive. 
Fortunately, the number of reflex properties and character- 
istics that can be measured is not so great as to make burden- 
some our task of selection. Here we find, as you would expect, 
such familiar aspects of responding as rate, latency, force, and 
the like, all of which are old friends in the analysis of be' 
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havior and require no further treatment here. In our later 
discussion of several representative drives, we shall see with 
greater clarity and in more detail how these measures are 
employed. 

Two Classes of Drives 

As we have pointed out before, many historical observa- 
tions of human and animal behavior have contained elements 
of great accuracy and faithfulness to nature. The age-old 
division of motives into appetites and aversions is based upon 
such observations, and remains useful to modern scientists. 

The twofold classification of drives arises from certain 
natural differences, (i) The actual operations which establish 
drives may be those of deprivation (for example, of food or 
water) or stimulation (as by electric shock or painfully strong 
lights or noises). (2) The types of reinforcement which are 
effective after, or appropriate to, these two operations also 
differ. (3) Appetites can be reduced or satiated, whereas aver 
sions cannot. Thus, given sufficient food, hunger is erased; 
but an animal that is aversive to electric shock or strong light 
cannot be satiated with no-shock or darkness. (4) Most dep- 
rivations must extend over some length of time before their 
drive effects become evident. The build-up may require 
hours, days, or weeks (think of thirst, sex, and the sub- 
hungers). By contrast, aversive stimuli act, for all practical 
purposes, immediately. No sooner are they applied than they 
result in widespread reflex changes and set up the conditions 
for the strengthening of responses that remove them. 

With respect to aversions, it is likely that any stimulus, 
sufficiently intense, may be drive-inducing. The intensity of 
stimuli forms a physical continuum going from zero to ex- 
treme magnitudes. At low but supra-liminal values they may 
serve an elicitative or discriminative function while remain- 
ing motivationally neutral, in that the animal will neither 
'work for them* nor 'seek to get rid of them/ At moderate to 
high values, they may take on an aversive quality, and re- 
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spouses may be strengthened by their removal. Extreme 
intensities of stimuli may, through their 'emotional* effect, 
be no longer suitable for discriminative or aversive purposes. 
(For further discussion, see Chapter 10.) At any rate, an 
attempt to list the known drives should probably, therefore, 
include under aversions a reference to all the stimuli to 
which an organism is responsive, with the understanding 
that this holds for a given, but as yet indeterminate, range of 
intensity values. 

Table X 

RELATION OF DRIVE-OPERATIONS TO REINFORCEMENT AND SUBSEQUENT 

BEHAVIOR 



Operation \ 

or | -*Drive 
Natural event) 



Contingency 

(What the response 

accomplishes) 



Effect of contingency 
on response strength 



deprivation-* Appetite 



a. obtaining the sub- 
tance (e.g. food): 
"positive reinforce- 
ment." 

6. not obtaining the 
substance (e.g. 
food): 

"withholding posi- 
tive reinforce- 
ment." 



a. Raises strength. 



b. Lowers strength 
(or fails to condi- 
tion), plus possible 
emotional effects. 



Stimulation Aversion 



a. removing stimulus 

(e.g. shock): 
"positive reinforce- 
ment." 

b. Not removing 
stimulus: 

"withholding posi- 
tive reinforce- 
ment." 



a. Raises strength. 



b. Lowers strength 
(or fails to condi- 
tion), plus de- 
pressed or emo- 
tional behavior. 
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Some Misconceptions about Drive 

The nodern conception of motives has taken firm shape 
only it the last few decades. Before that, motivation was dis- 
cussed -rom many standpoints, and, since the available facts 
were few, it is not surprising that many different conclusions 
were drawn. Our heritage from those years, as is so frequently 
the case in the history of science, is a mixture of valuable 
observations and now-discarded theories. We willingly accept 
the former, but hasten to correct the latter when they creep 
into modern thinking. For this reason we must pause here to 
indicate some outmoded notions of drive which seem espe- 
cially productive of confusion in a present-day context. 

i. A drive is not a stimulus. Historically, a stimulus was 
treated as a goad (stimulus is the Latin word for goad) and 
was often confused with motivation (motive means "move- 
ment-initiating"). These are, of course, only manners of 
speaking and are of little factual or theoretical value. A drive 
has neither the status, nor the functions, nor the place in a 
reflex, that a stimulus has. It is not a part, or a change in a 
part, of the environment; it is not, in itself, either eliciting, 
reinforcing, or discriminative; and it is not correlated with a 
single response (as is the stimulus) to give us our behavioral 
unit, the reflex. True, the distinction between state and 
stimulus is hard to maintain when we consider the aversions, 
which are stimulus-produced states, but we must not confuse 
the effect of the stimulus with the stimulus itself. We would 
say that the aversive stimulus sets up a drive, one effect of 
which is to change the momentary strength of a group of 
reflexes and another to make reinforcement possible. But to 
say that a stimulus excites one or more stimulus-response rela- 
tions directly is to deal in absurdity, leading us to such a 
paradigm as this: S (S R). 

Another difficulty in keeping drives and stimuli distinct lies 
in the fact that there are internal stimuli which accompany 
drives and may serve as S D 's for a response. Thus, an operation 
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like food-deprivation which establishes the hunger drive also 
gives rise to internal stimuli which may, in turn, occasion a 
response of one sort or anotherperhaps the verbalization 
I'm hungry. There is no cause to equate two things merely 
because they co-exist, when actually they have different prop- 
erties and play different roles in the control of behavior. A 
drive is not identifiable with the stimuli it may itself evoke. 

2. A drive is not a response. Although a motive involves 
behavioral changes, it is not in itself a response. An organism 
does not 'respond' (except in a figurative way) to an aversive 
stimulus or an appetitive deprivation by developing a drive 
state. Drive is established by these operations, but is not a 
response to them in any proper sense of the word. To put it 
in another way, a motive is neither a smooth-muscle or gland- 
ular respondent nor a skeletal-muscle operant. 

3. A drive is not a physiological concept. Everyone ac- 
knowledges that behavior is the behavior of an organism, and 
is consequently accompanied by physiological, chemical, 
mechanical, electrical, atomic, and other processes in that 
organism. One must also acknowledge that drive-establishing 
operations have similar broad consequences. But these facts 
do not compromise the status of drive as a behavioral concept. 
The concept arises from behavioral data, and it is in a science 
of behavior that it is required. The physiologist has no need 
for drive in his account of, say, blood changes during hunger. 
He correlates these changes with food-deprivation, just as we 
do reflex changes; and his correlations are physiological prin- 
ciples, while ours are behavioral ones. The laws of behavior 
are not the laws of physiology, neurology, chemistry, or phys- 
ics. Yet, one disadvantage of treating drive, even fictitiously, 
as a 'state,' is the tendency it fosters in the beginning student 
to think of it as something physiological which intervenes 
between the establishing operation and the reflex changes. 
This habit of thought is readily acquired by one who believes 
that behavior 'cannot really be understood' without reference 
to 'underlying bodily processes.' In the end, however, the 
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'understanding* of behavior depends upon finding lawfulness 
in behavior and, once found, it should not be ignored or re- 
nounced in favor of lawfulness in physiology or any other 
subject matter. 

4, A drive is not pleasure-directed. Motives involve neither 
the purposive nor the pleasure principle commonly discussed 
by the philosopher. When one says that an organism 'wants to 
obtain pleasure' or 'works to get the satisfying reward/ he 
uses a manner of speech which dimly expresses the action of 
reinforcement in strengthening a response. We have noted 
(page 255) that pleasant is a human verbal response that may 
be attached to certain S D 's and is related to the appropriate- 
ness of a reinforcer to the prevailing drive. Hedonic philos- 
ophies do not stick close to the facts of behavior in ascribing 
an objective existence to pleasure, and stressing the procure- 
ment of pleasure per se as a motive or the purpose of all 
motives. We do not deprive an organism of pleasure, but of 
food; we do not reduce hunger with pleasure, but with food, 
and the purpose of the organism is irrelevant to either dep- 
rivation or satiation. 



SOME REPRESENTATIVE DRIVES: 
ACTIVITY DRIVE 

Every response, whether operant or respondent, is an activ- 
ity and, in this sense, every psychological experiment deals 
with activity. But the terms activity drive and general activity 
have a different and definite meaning as they are applied to 
the motivational and behavioral characteristics of an organ- 
ism. Activity drive is classed among the appetites, and is the 
source of action when other reasons for behaving are ruled 
out or controlled. In nature, periods of activity are preceded 
by periods of inactivity during which the need increases, just 
as hunger drive appears in an alternation of eating and not- 
eating. The fundamental operation, then, is deprivation, and 
activity drive can be controlled in the laboratory by experi- 
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mentally varying the deprivation time (i.e., by imposing 
variable periods of inactivity). 

The drive is manifested in general activity, by which is 
meant the total over-all movement, or getting-around, of the 
animal. As we shall see, it is not necessary in measuring gen- 
eral activity to dissect it into its component specific responses 
like running, climbing, scratching, and so on. It may be 
recorded as though it were a single thing, although actually 
it is the gross sum of all the reflexes which have been increased 
in emission rate because of the drive operation. It is possible, 
however, and desirable in many connections, to use one com- 
ponent of activity as a measure of the whole, because the rise 
in strength of the component is a reflection of the general 
increase. General activity is influenced by other motives than 
activity drive (all drives produce concurrent changes in the 
strength of many reflexes), and it may then serve as an indica- 
tor of the presence of these others. It is only when these others 
are not present that one speaks of a genuine activity drive. 

An Illustrative Series of Experiments 

In 1922, Richter introduced a new method for recording 
the general activity of the rat. This made use of a triangular 
cage mounted on rubber tambours which were connected by 
air tubes to a kymograph recorder. As the animal moved 
about the cage, the shifting weight on the tambours affected 
the air pressure in the system, producing greater or less agita- 
tion of the kymograph pen depending upon the amount of 
movement, thus giving a measure of total activity. 

In one experiment, with other drives and external stimuli 
controlled, evidence for the activity drive was sought, and 
found, in continuous twenty-four-hour records of the activity 
occurring "spontaneously" with the mere passage of time. 
Active periods were seen to appear (ten to fifteen times per 
day) with good regularity of spacing and duration. This de- 
pended somewhat upon the age of the animal, both young 
and old animals being less active than those in their prime. 
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Although there were individual differences among animals at 
every age, each one revealed typical bursts of activity. The 
demonstration of pure activity drive has been paralleled with 
other organisms than the rat, and the drive has turned out to 
be a factor that has to be reckoned with in a great variety of 
psychological experiments. 




FIG. 64. Richter's tambour-mounted activity apparatus. 
(After Richter, 1922.) 

With the same apparatus, Richter went on to study the 
relation of various drive-establishing operations to general 
activity. He found that his method, although lumping to- 
gether all changes in reflex strength into the single measure 
of activity, was sensitive enough to show up the effects of the 
different operations. Thus, in an experiment with hunger, a 
group of rats was put on a twenty-four-hour eating rhythm. 
Each animal was allowed to eat to satiation from an unlimited 
supply once a day at the same hour, fresh water being always 
present. Half of the group was then deprived of both food and 
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water, while the other half was deprived of food alone, and 
the deprivations were continued without break until all the 
animals were dead. Each rat lived in his own tambour* 
mounted cage for the duration of the experiment, and con- 
tinuous twenty-four-hour activity records were taken. The 
results showed several things. In the first stage of the experi- 
ment, the twenty-four-hour rhythmic hunger sent activity 
above the usual level for satiated animals. Moreover, the tem- 
poral pattern of the activity over the day took on a character- 
istic form, there being a rise about twelve hours after a meal 
and another as the next mealtime approached. When the con- 
tinuous deprivation was started, the rats without food and 
water showed regular decreases in activity until the fifth day 
when all activity ceased (shortly before death), whereas those 
with food-deprivation only increased their activity for the first 
few days and then dropped off to zero (eighth day). The addi- 
tion of thirst to hunger, then, gave a different result from a 
mere increase in the effect of hunger alone. 

Richter also tested the effect of temperature on activity. 
Three temperature levels were used: 10-15, 2 3> an( ^ 2 9'3 
Centigrade. The rats were on a twenty-four-hour eating 
rhythm, with all other conditions controlled. He found less 
activity at either extreme than at 23 Centigrade. Experi- 
ments by other men, and with other organisms, support the 
general finding that a temperature drive, certainly among 
warm-blooded animals, is as valid a drive as any. Mass migra- 
tions, and the perennial flight of city dwellers to the seaside 
during a heat wave, bear witness to the power of a few milli- 
meters of mercury in a thermometer column. 

In still another experiment, Richter studied the 'nocturnal- 
ity' of rats, that is, the effect of illumination on activity. The 
animals spent alternate twelve-hour periods in the light and in 
the dark, these periods being controlled by the experimenter 
without reference to the natural day-night cycle. Before re- 
cording was started, they were put on a twenty-four-hour 
eating rhythm. To insure no contamination of the records by 
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activity related to feeding time, half the rats were fed just 
before the twelve-hour light period and half before the dark. 
Activity was found to be higher in the dark than in the light, 
with the difference increasing for older animals. This result 
has been confirmed by other investigators (Hunt and Schlos- 
berg, 1939) and is related to the depressive effect of light upon 
bar-pressing (Skinner, 1938), as well as the study of the light- 
aversion drive to which we shall return presently. 




FIG. 65. An activity wheel. 

In addition to the tambour-mounted cage, Richter em- 
ployed a laboratory device which had been in use for some 
time before his work, the activity wheel. This apparatus is an 
enclosed wheel, suspended by its axle so that it rotates easily, 
the arrangement being that of a circular tread-mill. From a 
living cage set by its side, the rat can enter the wheel when- 
ever the door is opened, and run as much and as fast as he 
wishes, with the wheel turning faster as he runs faster. Each 
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rotation is automatically tallied on a counter, and readings 
can be taken for any period without disturbing the animal. 
The wheel does not provide an index of general activity in 
the same way as the tambour-mounted cage, since only 
straight-away running is counted and neither lateral move- 
ments within the wheel nor partial revolutions are recorded. 
Nevertheless, data obtained with this device are highly cor- 
related with those from recorders like Richter's, and can 
therefore be taken as an equivalent measure. Using the wheel, 
Richter and other workers have confirmed and supplemented 
the earlier findings on pure activity drive and the effects upon 
activity of hunger, age, and other factors. Of especial interest 
is the finding that activity varies as a function of deprivation. 
If a rat is confined to a small cage and denied access to the 
wheel except for a short daily period, he will do more running 
in the time allowed than he would do otherwise; the drive 
apparently works like any other appetite in this respect. 

Other Illustrations of Activity Drive 

There are many indications of activity drive in non-experi- 
mental situations which point to its reality. Caged animals in 
zoos have periods of activity which visitors try to catch. Lions 
and tigers pace, elephants sway and 'dance/ seals swim and 
dive, monkeys climb and chase although other drives may be 
taken care of and the activity may seem 'unnecessary/ City 
dogs, ordinarily in leash, jump and run and roll on the 
ground when freed in a park or meadow, not unlike children 
let loose from school or spending the first hours of a country 
vacation. Some observers have even reported that human in- 
fants, overly restricted in movement, 'work off' their activity 
in rhythmic swayings, rockings, and jouncings within their 
play-pens or cribs. Such instances seem reasonably clear in 
terms of activity drive and activity deprivation. 
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Activity and the O per ant Level 

It was noted earlier (page 76) that the operant level of 
any representative reflex is related to two things: the general 
activity of the organism, and the ease of conditioning that 
response. The introduction of a drive like hunger is relevant 
to both of these. Not only does it put into our hands the power 
of reinforcement, hence of conditioning, but it also expedites 
the conditioning by raising the general activity. This rise will, 
if we have properly chosen our reflex, carry along with it the 
operant level, so that the response becomes more available 
for reinforcement and our chances of reinforcing and con- 
ditioning it quickly are enhanced. 

Other factors, of course, may enter in setting the operant 
level. There are species differences (think of the relative activ- 
ity of a turtle and a mouse); there are age differences within 
any one species; and you can probably think of half a dozen 
additional determiners. Species differences are especially im- 
portant when it comes to the form of the response to be 
conditioned. Pigeons can be taught to peck at targets, rats to 
press bars, monkeys to turn keys or lift latches, and so forth. 
The response, naturally, must be within the organism's 
capacity. Usually we select one that is easy, or natural, for 
the species in question. If one decides to use a response which, 
though possible for the organism but having an almost-zero 
operant level, conditioning will be slow and may even require 
a gradual differentiation. 

HUNGER DRIVE 

The beginning of scientific research on hunger is of inter- 
est, among other reasons, because it shows how, in the gradual 
refinement of a concept, different paths may be taken by 
different people; how, by experiment and hypothesis, explora- 
tion and reasoning, proof and counter-proof, there is finally 
left a residue of agreed-upon facts and conclusions which are 
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incorporated within the body o scientific theory. In using 
hunger as our model in the preceding pages, we have already 
said a good deal about this drive. We may, however, tell a bit 
of the early history of hunger research as a setting for some 
further facts. 

Richter's work on activity and hunger drive, together with 
information already at hand, contained within it the essence 
of a proper formulation of hunger and, indeed, of all drives. 
He had performed a deprivation; he had observed concurrent 
changes in the strength of many reflexes ('heightened activ- 
ity'); and he had two techniques for measuring these changes 
(the tambour-mounted cage and the running wheel). More- 
over, his techniques were able to measure the degree of the 
drive, that is, they showed the co-variation of amount of 
response change with amount of deprivation. With this 
ground-work laid, the concept of drive was ready to be de- 
veloped. At this point, however, he and other workers were 
diverted by a strong contemporary interest in the physiolog- 
ical correlates of hunger, and the appropriate behavioral 
concept of motives was side-tracked for several years. The 
physiological studies were centered upon events in the 
stomach resulting from food deprivation, and it was to these 
that attempts were made to relate the activity changes in 
hunger. 

Hunger and the Stomach 

In 1912, a decade or more before Richter began his work, 
W. B. Cannon, in collaboration with A. L. Washburn, pub- 
lished a paper entitled An Explanation of Hunger. Tracing 
back some sixty-six years the idea that "hunger is the result of 
(stomach) contractions," they felt that they were able to offer 
direct proof of the correctness of this view. Their data were 
obtained by means of a simple apparatus consisting of a small 
rubber balloon which was swallowed by the subject (Wash- 
burn, principally) after his stomach was emptied of food. To 
this balloon was attached a thin rubber tube that led up the 
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oesophagus and out the subject's mouth. The balloon was 
then inflated until it assumed the contours of the stomach. 
With the tube connected to a water manometer, any pressure 
upon the balloon could be observed and recorded by a floating 
marker on the water. After some practice, the subjects wers 
able to retain the balloon and tube without nausea, and 
experimentation was begun. 

The findings may be briefly summarized. A few hours after 
eating, when the stomach is empty, powerful stomach con- 
tractions begin. These 
contractions are of 
about thirty seconds' 
duration, and they oc- 
cur at thirty- to ninety* 
second intervals. They 
may go on, in this 
rhythmical fashion, for 
thirty minutes or more, 
to be followed by quies- 
cent periods of thirty 
FIG. 66. Portion of a record of human to ninety minutes, 
stomach contractions during hunger. The _ r , , , . 

white signals on the bottom line are the When the sub J ect > un ' 
subject's reports of "felt" hunger. The re- aware of the record his 
ports coincide with peak contractions. The stomach was writing 

^^cS^gST 1 " '" obeyed the experiment: 

er's instructions and 

signalled (by depressing a telegraph key) whenever he 'felt 
hunger pangs/ his signals were found to come when "contrac- 
tions were invariably being registered" and near the time 
when their force was maximal. 

From their data, Cannon and Washburn came to the con- 
clusion that hunger, which they described as "a dull ache or 
gnawing sensation referred to the lower mid-chest region and 
epigastrium/* was due to the stimulation provided by the 
strong contractions of the empty stomach. So great was the 
impression made by their research, that this stimulus theory 
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of hunger prevailed for many years. We know, today, that 
there is more to hunger than stomach contractions. It has 
been shown, for example, that food-getting activity continues 
with undiminished vigor when the stomachs of rats have been 
surgically isolated to the degree that stomach contractions 
could not possibly serve as stimuli for any responses (Bash, 
1939); that animals will eat prior to the appearance of the 
contractions and long after these contractions have disap- 
peared; and that changes in response rate as a function of 
deprivation are inconsistent with any known properties of 
stimuli, in the stomach or elsewhere. Yet the stimulus theory 
of hunger persisted, and so strong was its influence that theor- 
ists were led to postulate a stimulus nature for all other drives, 
even those which, like sex and the sub-hungers, should have 
encouraged a reconsideration of the evidence and led to a 
truer conception. 

Before Richter's time, there were two major obstacles in 
the way of a correct interpretation of drive. One was the Pav- 
lovian notion of stimuli as eliciting agents. In looking for the 
cause of behavioral changes associated with drive, the only 
known possibility was the elicitative action of stimuli. The 
recognition of operant behavior still lay ahead. The second 
obstacle was that the subject, as in the Cannon- Washburn 
study, could report the 'experience* of hunger pangs. They, 
and later workers, took the verbal responses and signals, not 
as behavioral data to be explained, but as objective indicators 
of the drive's presence and action. Today we know that a 
drive-establishing operation has among its consequences the 
creation of internal stimuli which can act as S D 's for operant 
responding. Stomach contractions are stimuli of this sort. 
They are one bodily result of fasting, and they are certainly 
discriminable by the organism, as Cannon and Washburn dis- 
covered when their signals 'I am feeling a hunger pang* were 
seen to be occasioned by the contractions. But, while the con- 
tractions are S D 's for the verbalization or the report of a 'sensa- 
tion* or 'experience' of hunger, they are not the drive itself. 
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The 'feeling' of hunger is not hunger, or, to say it differently, 
hunger is not in the feeling of it. 

The Measurement of Hunger 

As with any drive, the measurement of hunger is made 
possible by any response change which co-varies with the 
degree of the drive-establishing operation amount of dep- 
rivation. The general activity method gives the needed rela- 
tion between deprivation and response change. In a crude 
way, it provides a measure of response rate, since with it one 
obtains the total of response emissions over a rather broad 
period of time. Both these things are true of the second tech- 
nique, the activity wheel. The obstruction method is one in 




FIG. 67. Floor plan of the Columbia Obstruction Box. The labeled 
parts are: A starting chamber; B passage-way with grid floor; C and 
> reinforcement chambers. (After Warden, 1931.) 

which the response must be made in the face of a deterring or 
opposing 'resistance' like electric shock. That is, the animal 
must submit to a negative reinforcement before he can pro- 
ceed to the positive. The animal is acclimatized in a box, with 
two chambers, and allowed first to run from one to the other 
across an uncharged grid to a reinforcement, such as food. 
Then the shock is turned on, and a test period of, say, twenty 
minutes is given to see how many times he will take the shock 
to reach the food. After each crossing, he is allowed a nibble 
and then quickly transported back to the starting chamber 
for another trial. This method has features which complicate 
the data got from it. Principally, one must remember that it 



MOTIVATION 289 

depends upon a conflictful situation in which positive and 
negative reinforcement are combined and in which two 
drives, an aversive and an appetitive, are always operating 
together. 

A more recent, and very satisfactory, method employs re- 
sponse rate under P-R. Co-variation between deprivation and 
response is shown in the emission rate of the response, which 
is a fundamental indicator of operant strength. Figures 62 and 
63 (pages 267 and 268) have already illustrated this method as 
it was first used by its originator with the bar-pressing re- 
sponse. If you will look back at these graphs, you will see that 
the changing slope of a P-R curve is admirably suited to 
measuring degrees of hunger. The measure is uncomplicated 
by a simultaneous aversive drive; it is sensitive to small 
changes in drive level; it is applicable to a single reflex rather 
than a mass of reflexes; the emission pace is not restricted by 
arbitrary trial imposed on the organism by the experimenter; 
it is useful with a single animal, and does not require the 
averaging of data for many; and, finally, it yields a measure of 
reflex strength at any desired moment, instead of a gross one 
for a long test period which may run to a full day. Such a 
procedure is superior to the use of eating curves (which may 
have suggested itself to you) because, in the latter, the rate of 
responding depends so much upon the rate at which the 
animal can eat (see Figure 7, page 43). Above a certain 
point of deprivation, the animal eats as fast as he can anyhow, 
with the result that the slope of an eating curve is a relatively 
insensitive measure of drive strength. P-R curves depend less 
upon eating time, and their slopes are freer to vary. 

Hunger Drive in Man 

As is true of so many behavioral problems, direct experi- 
mentation upon human subjects is very difficult to arrange 
in the case of hunger. Social pressures militate against the use 
of infants, and adults are poor material because of their long 
and unknown personal histories of conditioning and motiva- 
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tion. Yet, there is no reason to believe that hunger is any less 
important for man than for lower organisms. History records 
huge migrations in search of food, and wars over bountiful 
lands; there are reports of human cannibalism under extreme 
starvation despite all previous training; and there is, of 
course, Napoleon's classic and highly credible statement that 
"an army travels on its stomach." The psychologist, Wada 
(1922), who studied the relation of hunger contractions to 
activity in infants, felt that hunger was initially the strongest 
of all drives, since the child when not hungry is usually asleep. 

Several studies have shown that, among the human re- 
sponses which a drive like hunger can affect, we must include 
the verbal. Discriminative stimuli normally insufficient to 
evoke food-word responses may become increasingly effective 
under stronger degrees of hunger. Thus, Sanford (1936) 
sought to find the relation of hunger to the number of food 
words (names of foods, meals, etc.) emitted by a group of 
college students. Tests of word association, word completion, 
and the like, were given (without the subjects' knowledge of 
their true purpose) at various between-meal periods and after 
a twenty-four-hour fast. Results indicated an increase in food 
words as a function of increased time since eating. In a similar 
type of experiment (Levine, Chein, and Murphy, 1942), 
volunteer adults underwent food deprivations of one, three, 
six, and nine hours, after which they were shown various 
ambiguous or 'nonsense* figures briefly behind a ground-glass 
screen and asked to name the 'objects pictured/ The per- 
centage of the times that foods were 'seen* was greater for the 
hungrier subjects. 

Such results remind one of the great amount of sexual con- 
versation heard among the members of isolated and sex- 
deprived groups soldiers, convicts, and others. Also in line 
with this, it is well-known that extreme degrees of drive may 
precipitate 'illusions' wherein very weak or usually non- 
generalizing stimuli may be responded to in a manner that 
seems pathological to the unmotivated onlooker. A point may 
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be reached, in fact, where drive is so strong that no external 
S D at all may be required for the response to appear: a starving 
man may 'see' his favorite dishes before him. 

THE SUB-HUNGERS 

The possibility of demonstrating sub-hungers depends on 
the fact that the withholding of food need not be an all-or- 
none matter, but can be limited to particular constituents. In 
place of total deprivation, an animal is offered all he can eat 
of a diet which is unbalanced or deficient in some respect. 
The question is whether such operations (selective depriva- 
tions or selective satiations) have effects on subsequent re- 
sponses which warrant our speaking of various sub-hunger 
drives. 

Testing for Sub-Hungers 

There are several ways in which sub-hungers have been 
demonstrated. In the satiation method, as used by Bousfield 
(1938), the animal is first allowed to eat until sated with one 
food, after which that food is removed and one or more others 
are offered one at a time. When cats were used as subjects and 
four kinds of food were rotated in order of presentation on 
successive days, he found that satiation with one food might 
lead to no further eating when the others were presented, 
whereas satiation with another might be followed by succes- 
sive satiations on all those that followed. 

A second procedure is that called the free choice or selective 
eating method. This gives the animal an opportunity to select, 
in any amount he wishes, from among two or more foods 
made available at the same time. The method can be used on 
animals that have been held for some time to a diet deficient 
in some elements, or with animals that, although reared on 
an adequate diet, have created their own selective deprivation 
by virtue of one-sided choices from among foods offered. 
Numerous experiments with a number of animals (pigs, cows, 
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chickens, mice, etc.) agree quite well in leading to the conclu- 
sion that, given a choice, organisms will tend to select the one 
food needed for an optimal diet or will draw in time from 
many sources to maintain normal health and growth. More- 
over, the choice among rations is governed by the changing 
requirements of the organism during maturation, pregnancy, 
illness, and so on. 

Table XI 
DATA FROM ONE CAT IN AN EXPERIMENT USING THE SATIATION METHOD 

IN THE STUDY OF SUB-HUNGERS 

(Data from Bousfield, 1938) 





Number of Grams of Other Food Eaten after Cat 
Was Satiated with the Food at the Left 


Milk 


Milk, 
cooked 
oatmeal, 
raw ground 
beef 


Ground 
fish and 
milk 


Ground 
beef- 
kidney and 
milk 


Milk 





5* 


12 


130 


Milk, cooked oat- 
meal, raw ground 
beef 


o 


_ 


30 





Ground fish and 
milk 


36 


40 


- 


42 


Ground beef-kidney 
and milk 


o 








- 



An experiment, performed by Davis (1928) on three in- 
fants, supports the surmise that the phenomenon of sub- 
hunger is not limited to lower animals. Just after weaning, 
the children were started on a selective eating procedure 
which lasted six months for two of them and an entire year 
for the third. At each mealtime, a number of natural and 
cooked foods, including liquids, were placed on a tray before 
them. They ate with their fingers, or spoons, or more directly, 
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as they wished, choosing freely from among the offerings. Over 
the long course of the experiment, they showed themselves 
to be omnivorous, and their choices were nutritionally broad 
enough to promote health, energy, good sleeping, and some- 
what better than normal weight increases. Occasionally, one 
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FIG. 68. Model of an apparatus used by Young for studying food 
preferences and sub-hungers. A entrance chamber; # choice chamber; 
C 1 and C 2 proffered foods. (From Harlow, in T. G. Andrews, Methods 
of psychology [John Wiley & Sons, Inc., 1948]. By permission.) 

of the children would develop a strong preference for one 
food (e.g., cereal or meat) which would last for days, but 
would then ease off in favor of a wider selection. It was even 
recorded that one infant, who had rickets at the beginning 
of the study, took cod-liver oil until the condition disappeared, 
and then stopped taking any more! 

Some Final Observations 

These findings lend credence to non-experimental reports 
of the way men and animals adjust their eating to their dietary 
needs. Deer travel far to reach licks when they require salt; 
cattle in phosphorus-poor areas will gnaw on bones to supply 
the lack; dogs may eat grass when ill; malnourished children 
are said to eat earth or schoolroom chalk because of the min. 
erals they contain. Within the laboratory, if rats in a colony 
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are not properly fed, cannibalism may appear as a means of 
dietary compensation; and the same reason may lead par- 
turient female rats to devour their new-born litters. 

Although these things seem dramatic at first reading, it 
should be recognized that animals in the wild state, granted 
a varied and full supply of food, are always faced with the 
problem of what they should eat. That they have survived, 
matured, and multiplied points to the biological value of 
sub-hungers in the life of the individual and the species. Con- 
trariwise, it is curious to note how human adults, under the 
direction of social learning and the pressure of custom or 
fashion, may suffer from malnutrition despite an abundance 
of food. He whose menu is dictated by the social acceptability, 
rather than the adequacy, of foods, needs to have his natural 
sub-hungers given a helping hand by the science of dietetics. 

SEX DRIVE 

There is a large amount of scientific information and gen- 
eral misinformation concerning sex more than we can out- 
line (or counteract!) here. We must limit ourselves to some 
introductory facts. 

The sex drive is classed with the appetites because it in- 
volves responses which change in strength as a function of 
deprivation and satiation. With the females of many species, 
the drive is marked by temporal cycles so long as satiation 
(or insemination) is not provided. Seeking, approaching, 
courting, and copulating are broad categories of responses 
affected by sex drive. Changes in the strength of such re- 
sponses are fairly clear in lower organisms, but obscure in 
man owing to long and meticulous social training. As a basic 
motive in animals, sex has been investigated with methods 
similar to those employed with hunger and other drives. 

In our short treatment of sex drive, we shall refer often to 
physiological correlates of the drive. Much work has been 
done by other sciences on sexual mechanisms. But sex drive 
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is still a behavioral matter, and a stimulus or glandular theory 
of sex behavior would be as extraneous here as a stomach- 
contraction theory of hunger. 

Some Classical Studies of Sex Behavior 

A few pioneer studies may be cited, both for the informa- 
tion they supply and as illustrations of the experimental 
approach to the problem. These studies may logically, and 
for historical continuity, be organized around the apparatus 
and methods employed. Two lines of development will suffice 
for our purposes. 

i. Studies with the activity wheel. In 1923, Wang, working 
in the same laboratory with Richter, reported his findings in 
an experiment on the activity of adult female white rats. In 
the wheel, each female showed a highly consistent cyclical rise 
and fall in her activity, with peaks every four to five days and 
declines on intervening days. Wang followed up this observa- 
tion, and proved that the cycle was intimately correlated with 
events in the reproductive organs of the female, events called 
the oestrus rhythm. Activity is greatest at the height of oes- 
trum, at which time there is a flow from the vagina and the 
female is said to be 'in heat.' At this time, too, ovulation 
occurs so that the female is maximally ready for insemination, 
is most receptive to sexual advances by the male (when not in 
oestrum, she is likely to resist copulation and fight off the 
male), and is most stimulating to the male by way of olfactory 
cues given off by the vaginal discharges. The rise to, and 
recession from, the peak of oestrum coincides with increasing 
vaginal discharge (and the cellular content of the flow is a 
continually changing index of oestrus stages), while between 
peaks there is no discharge at all. (An interesting comparison 
may be noted here parenthetically. The rat's oestrus period 
does not correspond functionally to the menses of human 
females. In contrast with the rat, the latter are fertile approxi- 
mately midway between menstrual periods, and infertile at 
menses when the ovum is being discarded in the vaginal flow. 
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Also, for most human females, the susceptibility to fertiliza- 
tion is low just before menses when the uterus is preparing 
for menses, and again just afterward when ovulation may 
not yet have occurred. It is these features of human fertility 
that are the basis of the so-called 'rhythm method' of birth 
control. Some of the higher apes are thought to resemble the 
human female in this respect, but it is not yet clear just where 
in the evolutionary scale, and why, this curious reversal 
occurred.) 
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FIG. 69. The effect of the oestrus cycle upon the activity of the 
female white rat. Notice the regularity with which the peaks of activity 
occur coming at the height of oestrum. (After Wang, 1923.) 

Wang and later workers went on to show a number of 
other correlations. The characteristic activity cycle of the 
adult female rat is missing before puberty, and commences at 
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puberty when oestrum does; it is absent during pregnancy, 
and after parturition (while lactation is going on), two periods 
when the oestrus rhythm is in abeyance; it is restored after 
weaning of the litter, when oestrus rhythm reappears; it can 
be permanently erased, just as oestrum is, by surgical excision 
of the ovaries, an event that also lowers all activity and renders 
it more uniform from day to day. 

The male rat does not exhibit the sex-activity cycle seen in 
the female. Male rats, like human males and others, differ in 
this regard from many species in which males as well as 
females have special mating seasons. Strength changes in all 
sex reflexes are the result of deprivation and satiation. The 
male's sex drive is lowest right after a period of free access to 
a female in heat, but recovers much of its strength within a 
few hours following. 

2. Studies with the obstruction method. The first to devise 
an obstruction apparatus using electric shock for the study of 
drive was Moss (1924). His apparatus was a forerunner of later 
improved models such as the Columbia apparatus constructed 
by Warner and described earlier (page 288). With Moss's ideas 
of drive (namely, that drive is an "impelling force" the 
strength of which could be measured by the amount of 
"opposing resistance" or "repelling force" it would overcome) 
we are no longer concerned. But his device and his findings 
were the instigators of much worthwhile research and are 
still, therefore, of interest. Having undertaken to measure the 
sex drive of the rat by its "willingness" to undergo electric 
shock in order to reach a sex-object, he compared sex and 
seventy-two-hour hunger by permitting rats to choose be- 
tween a compartment containing food and one containing a 
mate. His observations (on too small a number of animals to 
be significant) were that females in heat cross the grid more 
often to reach the male than the male does to reach the 
female; and also that hunger could be a stronger motive than 
sex, since about 80 per cent of the males chose food rather 
than a female. Tsai (1925), using a method of simple choice 
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without any shock, came to a similar conclusion. His male 
rats, only twenty-four-hours hungry, chose food rather than a 
female in about 75 per cent of the chances given. While sex 
has been condemned or extolled as the most powerful of 
drives, this is apparently not true even of infra-human 
animals. 

The strength of sex drive as related to length of sex dep- 
rivation was investigated by Warner (1927) with the Colum- 
bia obstruction apparatus. Male rats under six degrees of 
sexual deprivation were tested for grid crossings to a female 
in oestrum within a standard observation period of twenty 
minutes. The six deprivation periods, timed from the last 
session of unrestricted copulations, were o, 6, 12, 24 hours, 4 
and 28 days. He concluded that maximal drive was reached 
after about twenty-four-hours 1 deprivation, with subsequent 
slight decline to the twenty-eighth day. A breakdown of War- 
ner's data by Leuba (1931), however, raised some doubt about 
the decline after maximum. If the grid crossings are tabulated 
for the four 5-minute parts of the whole go-minute session, the 
crossings after 24 hours and 28 days distribute themselves as 
follows: 

AVERAGE NUMBER OF CROSSINGS FOR GROUPS OF ANIMALS 



Total 



11.6 

The acceleration of crossings by the 28-day group within the 
2o-minute test period, despite its smaller total indicates two 
things: (i) after 28 days, adaptation to the shock builds up 
more slowly; and (2) that a 2o-minute test period is too short 
to have a full indication of drive strength. There is reason to 
believe that sex drive in male rats does not decrease after a 
maximum, but instead that after the first rapid increase there 
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are diminishing increments with prolonged deprivation. It 
may be mentioned incidentally that Warner, too, showed that 
sex can be outweighed by both hunger and thirst as motives. 

The obstruction method has also been used to study 'ma- 
ternal behavior.' In the female white rat, this is a complex of 
many activities. Objectively, it appears as nest-building, nurs- 
ing, retrieving the young or returning to the litter when sepa- 
rated from them, and more. Nissen (1930) found that the 
number of grid crossings a parturient female would make in 
order to reach her litter was maximal just after birth, and 
decreased until the young were of weaning age. Many ob- 
servers have recorded their belief that reinforcement is the 
basis of this reaction: regular suckling by the young relieves 
breast congestion in the mother, and allows normal function* 
ing of the milk glands; the decline in lactation coincides with 
weaning time (about twenty-one days after birth), and return- 
to-litter behavior fades out. Although physiological correlates 
are not known as yet for the other components of maternal 
behavior, it is a fact that they all lose strength up to weaning 
time, after which the female's oestrus rhythm re-emerges. We 
may note in passing that the principles of stimulus generaliza- 
tion and discrimination show themselves in the operant 
reactions that compose the maternal pattern. Females will 
retrieve, and carry back to the nest, not only their own wan- 
dering offspring, but will take young from the litters of other 
mothers if the age-size difference from their own young is not 
too great. When the difference is large, say ten days or more, 
they show some discrimination and will reject the strange 
young. Moreover, females will cross a grid to litters not their 
own, will suckle other young, and, more strikingly, will re- 
trieve even inanimate objects like a piece of wood or a sand- 
bag if it approximates the size of their offspring. 

Because of our social training, we tend to think of 'maternal 
behavior' as tender, loving, protective and care-providing. 
We call women who act otherwise 'unnatural mothers.' But 
the female rat, though she will take many shocks to return to 
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her litter, will also on occasion, if severely upset emotionally 
or subjected to dietary deficiencies, kill or devour her young. 
And if we get outside the confines of our society, we find prac- 
tices like infanticide acceptable under certain circumstances. 
It is always dangerous to mistake our individual ethical ideals 
for either the facts or the potentiality of human and animal 
behavior. A practical program of ethical training must take 
account of fact, but fact pays no heed to our ethical goals. 

Human Sexuality 

Although the sex drive was always accepted as important 
for infra-human organisms, the emphasis upon its significance 
in human living has come about in relatively recent times. At 
times its role among men has been greatly, even grotesquely, 
exaggerated. Controlled studies of sex among human beings 
are naturally difficult. Society's laws, personal reluctance, and 
the researcher's own sympathies stand in the way of experi- 
mentation; and field studies are handicapped by socially 
induced attitudes that block the information-seeker. The 
result is that real evidence on the characteristics and modes of 
human sex behavior and development is very scanty. Yet it is 
just here that interest is greatest, so that speculations and 
theories are put forth in bewildering and distorted profusion. 
Where facts are few and interest is high, the temptation to 
make a guess and to voice an opinion is hard to resist. Only 
in the last few years has there been in progress (Kinsey, Pome- 
roy, and Martin, 1948) the first large-scale attempt to get 
some elementary statistics on the sex activities of typical 
Americans of all ages, social levels, educational backgrounds, 
and of both sexes. This study, it is estimated, will take some 
ten to fifteen years, will cost some millions of dollars, and will 
end up with a sample of 100,000 persons in the total popula- 
tion of 140,000,000. While this return may seem small, it far 
surpasses any effort to date, and represents the lengths to 
which one must be prepared to go for statistics on human 
sexuality. 
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Beyond the statistical tabulations, however, remains the 
problem of how sex activities reflect behavioral principles. 
A premonition of the complexity and multiple causality of 
copulation by human males, for example, can be got from 
castration studies of the male rat (Stone, 1927). The effect of 
castration on the rat depends principally on two factors, the 
age and amount of previous experience with sex at the time 
of castration. Castration before puberty eliminates copulation 
after puberty; and castration of an adult male who had been 
raised in isolation, will keep him from copulating when he is 
later given access to a female. If, however, an adult male with 
ample previous experience is castrated, his sex behavior may 
continue for some time albeit at diminished frequency. Med- 
ical literature contains very few post-castration studies of men 
injured accidentally or operated on for medical necessity. 
Those available indicate that frequency of intercourse, and 
the pleasure reported, may be little, if any, reduced by the 
operation. The action of secondary reinforcements in these 
instances shows that there are other reasons than glandular 
foi sexual intercourse, just as there are other reasons than 
stomach contractions for eating. These sexual S r 's probably 
include both exteroceptive and proprioceptive stimulation. 

Sexual Discriminations and Differentiations 
Among infra-human animals, it is frequently seen that the 
stimuli and responses which are correlated with sexual rein- 
forcement may be operative with a minimum of condition- 
ing. It is often so difficult to tell whether the stimuli are 
eliciting or discriminative, that instances have sometimes 
been called "unlearned behavior." Some European zoologists 
have simply spoken of "innate releasing mechanisms'* in their 
attempt to explain the interaction of certain animals (e.g., 
the male and female stickleback fish) in their sexual behavior. 
Animals at the mammalian level may show a similar quick 
establishment of sexual chains. The fact is that one cannot yet 
tell with assurance when he is dealing with accounts of 
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speedily conditioned operants or of respondents requiring 
no conditioning at all. 

Male rats raised in isolation from females will mount and 
copulate, without overmuch delay, when eventually paired 
with a female in heat. The S D 's for the male rat are largely 
olfactory, but anosmic males (with the sense of smell de- 
stroyed) will still copulate, since there are other S D 's -for 
example, the short, jerky movements and leaps of the female 
in oestrum. Before puberty, male rats are not responsive to 
the smell cues of oestrum nor any other S D 's from the female. 
At the human level, males respond to secondary sexual char- 
acteristics of the female, such as shape, contour, voice and 
others, as S D 's, and in addition are affected by social S D 's like 
fashionableness of dress, skill with cosmetics, flirtatiousness, 
and so on. In contrast with the rat, too, the human male may 
need some time to learn the responses involved in copulation. 

Under certain conditions, sexual discriminations and dif- 
ferentiations of higher animals, and especially man, are not 
only 'incorrect/ but may take a turn conventionally termed 
'abnormal/ These deviant responses are learned in accord- 
ance with the same laws which govern the acquisition of 
'normal' behavior, but are deemed queer because they are out 
of step with more common patterns or conflict with the codes 
of the society in which they occur. Thus, in one of Moss's 
studies, male rats were reared together for 50-150 days without 
females, with the result that they seemed to lose interest in 
females and mounted each other in preference. A similar find- 
ing has been reported for birds, with males reared together 
preferring males to females. Another example of early rein- 
forcement interfering with 'normal' sexuality is that in which 
birds raised with members of another but related species may 
later prefer them as mates rather than their own species. 
Human homosexuality involves much more intricate sets of 
personal relationships, but there is little doubt that discrim- 
inative training plays an important role in it. 

As with other motives, high sex drive may produce re- 
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sponses to stimuli which are not ordinarily sexual. Deprived 
persons may see a sexual symbolism in things like mountains 
and valleys, guns and targets, ladders and windows, and many 
other objects. Under high drive, stimuli of all sorts may gen- 
eralize as S D> s and become metaphors, as it were, of true sexual 
S D 's for evoking sex talk or action that is bizarre to an ob- 
server. A phrase with double entendre, an unconscious pun, 
a mispronounced or misspelled word, oftimes embarrassingly 
reveals the motive underlying a speaker's or writer's avowed 
intention. Such 'slips' of the tongue or pen were cited by 
Freud as the "psychopathology of everyday life," and he 
ascribed them to the workings of a ' 'subconscious mind." 
There is, however, nothing esoteric about motivational 
strengthening of reflexes, even when the person cannot tell 
us, or is 'unaware of/ his present motives. 

THE AVERSIVE DRIVES 

Aversions, like appetites, form a major class of drives. 
There are two criteria for determining whether a given 
stimulus is aversive: (i) a response made in its presence should 
be strengthened by its removal; and (2) a response made in 
the absence of the stimulus should be depressed if it is fol- 
lowed by the administration of the stimulus. You will note, 
of course, that these are also the criteria for a negative rein- 
forcer. In this chapter, however, our interest will be in the 
motivating function of such stimuli, which may, therefore, 
be named aversive stimuli. The three aversions chosen for 
comment herelight aversion, sound aversion, and shock 
aversion by no means exhaust the list. Others are known, 
still others are suspected, and the total number of them, as in 
the case of appetites, is a matter of empirical discovery for 
each species of organism. 
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Light Aversion 

This drive can be demonstrated rather easily in the white 
rat. Richter, it will be recalled, found that the rat, a 'noc- 
turnal' animal, is less active in light than in dark, suggesting 
the depressive action which we now expect of an aversive 
stimulus; and several experiments in recent years have shown 
that light-removal may constitute a positive reinforcement for 
conditioning a response (Keller, 1942). The second criterion 
of an aversion to light can be met by administering the stimu- 
lus whenever a given response is made. In a trial experiment, 
Schoenfeld (1946) used a hungry rat daily in one-hour ses- 
sions in a bar-pressing apparatus with no positive reinforce- 
ment given for pressings. When each response to the bar was 
immediately followed by five flashes of a bright light the rate 
of operant-level pressing dropped swiftly and remained low 
for some days after the punishing stimulus was discontinued. 

A combination of the above-mentioned procedures can be 
very strong in its effect. Hefferline (see page 312) rigged his 
apparatus so that a rat's depression of the bar would turn off 
a light, but release of the bar would bring it back on again. 
He reasoned that the bar-pressing response has two compo- 
nents, an 'up' and a 'down' response. The 'down' received a 
positive reinforcement (light-removal) and the 'up* a nega- 
tive (light-onset); the positive reinforcement could be ob- 
tained, and the negative reinforcement avoided, only by 
pressing and holding the bar down. The rats in this situation 
conditioned quickly to prolonged holding, so that they were 
immobilized on the bar for as much as 95 per cent of each 
experimental hour. This failure to 'unmake* a response is 
related to those preference situations in which animals move 
into, and stay in, dark rather than lighted areas, as well as to 
other instances of sticking to 'safe* environments. It also helps 
us to see why, in experiments using aversive stimuli, some 
animals cannot be conditioned at all. If a partial reinforce- 
ment can be obtained in some other way, as by hiding one's 
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head or covering one's eyes in a light-aversion situation, the 
animal may be immobilized through the combination of this 
reinforcement with the punishment consequent upon moving 
from the position with the result that a more effective rein- 
forcement is never achieved. 

An amusing aspect of the effect of combining positive with 
negative reinforcement in the manner just described is that 
provided by a simple experiment routinely carried out by 
first-year students of psychology in Columbia College. Some 
object like a metal coin or piece of wood is placed in the rat's 
cage and a bright light is turned on overhead. As soon as the 
animal, in moving about, makes contact with the coin, the 
light is diminished; and, as soon as he leaves the coin, the light 
returns. Within a few minutes, the animal shows a proto- 
typical 'fetishism': he stays close to his coin, taking it in his 
teeth, chewing on it, and carrying it about with him wherevei 
he goes. Although one would hesitate to say that the rat wor 
shipped the object for its magical powers, one cannot but be 
reminded of the rabbit's feet and lucky coins at a highef 
evolutionary level. 

Sound Aversion 

An aversion to certain sounds is found in many animals, 
including the student who is annoyed by such things as the 
squeak of chalk on a blackboard. Among rats, 'audiogenic 
seizures' are known to result from stimulation with escaping 
air, jangling keys, electric bells, and the like. These seizures 
are marked by periods of wild, erratic behavior alternating 
irregularly with trembling, prostration, and bodily rigidity; 
an unnatural lack of sensitivity to other stimulus influences 
(as when the animal is poked with a stick); and a condition 
somewhat like the "waxy flexibility" of certain patients in 
mental hospitals whose limbs and body are passive and may 
be molded into various positions. Recovery may seemingly be 
complete on the day following the seizure, but a new episode 
is more easily induced on later occasions. 
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In an exploratory experiment, two rats were trained in a 
working-box that contained a loud buzzer. In daily one-hour 
sessions, the buzzer was sounded and bar-pressings were reg- 
ularly reinforced by sound-removal for one-minute periods. 
The response was quickly strengthened and soon came to be 
very stereotyped in appearance, with the animal remaining 
close to the bar in the silent periods. The latencies of the 
response to sound, which were characteristically short, showed 
an interesting cyclical effect that may also be observed occa- 
sionally under light aversion. In a short series of stimulus 
presentations, they became gradually of shorter and shorter 
duration; then they would suddenly lengthen and begin an- 
other gradual decrease; with such lengthenings and shorten- 
ings continuing throughout most of an experimental hour. 
Apparently, the latency decreased to a point at which the 
escape response came so closely on the heels of the stimulus 
onset that the response was virtually punished, thus bringing 
about a temporary depression of the sound-removing behavior 
that showed itself in a lengthened latency. 

As in other aversion experiments, those with sound require 
rapid conditioning if the animal is not to be 'beaten down' by 
the stimulus and fail to respond in the desired manner. It 
would probably be of advantage in such experiments to use 
as the to-be-conditioned response one which is easier still than 
bar-pressing and has a higher operant level. The faster the 
conditioning, the less should be the likelihood of producing 
audiogenic seizures. 

Shock Aversion 

Electric shock is perhaps the most generally aversive of all 
stimuli for most organisms. In the rat, it easily satisfies both 
criteria for an aversive stimulus: a response removing it is 
strengthened, and one that brings it on is depressed. We may 
take Mowrer's (1940) study as an example since his results 
resembled those obtained under light aversion. He used a box 
in which electric shock to the animal's feet built up gradually 
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from zero intensity to one that was highly upsetting, with the 
'pain threshold' reached at approximately the end of the first 
minute of application. As the shock increased above this 
point, the rat became very active, even frantic. If, during this 
activity, he bumped or pushed a panel at one end of the box, 
the shock returned to the zero level and gradually built up 
again in intensity. Conditioning was rapid; the successful 
response became very stereotyped; 'freezings' to the panel 
were often observed following shock-removal. All these phe- 
nomena are equally typical of light-aversive behavior. It is 
important when using shock for conditioning, even more 
than in the case of light or sound, to avoid over-stimulation, 
else the organism may be immediately thrown into a flurry 
of convulsive activity which precludes any form of adaptive 
behavior. 

Human Aversions 

That human beings possess aversive drives seems beyond 
question. Infants, by their squirmings and waitings, and 
adults, by their verbal reports, indicate that they find some 
stimuli noxious, painful, or unpleasant. They easily learn to 
get rid of strong shocks, glaring lights, grating noises, putrid 
odors, pricking pins, and the like. Yet, when brought into the 
laboratory, their aversion is not always obvious. When we 
attempt, for example, to condition a finger-withdrawal under 
shock-aversion, our subject may thwart us by taking the shock 
without complaint unless it is of exceptional severity. Our 
experiment has perhaps not taken into account the subject's 
reinforcement history, in which there may have been given 
much positive reinforcement for stoical behavior. 

It is a commonplace of human experience that stimuli 
which were once unpleasant can lose this character and be- 
come acceptable, even sought after, as a result of training. 
Adults may like bitter and spicy foods that infants will reject; 
they come to enjoy music that is cacophony to children or 
other adults; they welcome such otherwise annoying stimuli 
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as the vibrations applied by the barber to a balding head. We 
have already mentioned such matters in our chapter on sec- 
ondary reinforcement. It is not simply that the aversion is 
outweighed by conflicting drive, but rather that the aversive 
stimulus becomes a secondary reinforcement. This is related, 
not only to our subject's failure to withdraw his finger from 
shock, but to the aging boxer's refusal to quit the ring, and 
the neurotic's masochism. 

A simple experiment, easily carried out in an undergrad- 
uate laboratory, provides the prototype of masochism. Hungry 
rats are conditioned to bar-pressing for food pellets, some 
receiving regular and others periodic reinforcement. Each 
reinforcement is accompanied by a mild electric shock. After 
an hour of such training, extinction of bar-pressing is begun, 
with neither food nor shock given for the response. When 
extinction is well advanced, the shock alone is reintroduced 
as an accompaniment of bar-pressing. The result is commonly 
a rapid acceleration in response rate, which appears as a 
sizable bump in the cumulative response curve. To an unin- 
formed bystander watching the animal's behavior, it appears 
that the rats enjoy the shocks and are deliberately punishing 
themselves 

ANXIETY AND AVOIDANCE 

Pavlov once wrote: ". . . it is evident that under natural 
conditions the animal must respond ... to stimuli which sig- 
nal the approach of harm or good. ..." We now ask what 
happens when a discriminative stimulus precedes an aversive 
one. We approached this problem in the preceding chapter, 
in our discussion of secondary negative reinforcement. Two 
experiments can be designed. In one, the S D is followed 
inevitably by the aversive stimulus; in the other, some re- 
sponse to the S D suffices to keep the aversive stimulus from 
coming. These two arrangements provide operational defini- 
tions of "anxiety" and "avoidance" respectively. 
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Anxiety 

In an investigation of anxiety, Estes and Skinner (1941) 
dealt with the problem in the following way. At intervals 
during periodic reconditioning of the bar-pressing response,, 
a soft tone came on and sounded for a period of five minutes, 
at the end of which an electric shock was delivered. The tone, 
at the outset, had no appreciable effect upon bar-pressing rate, 
but very soon it took on a markedly depressant character at 
each presentation, from which there was immediate recovery 
as soon as the shock had been given. Bar-pressing during the 
tone ceased almost entirely, with the undoubted accompani- 
ment of other reflex changes which might easily be labeled 
'anxious/ Extinction of this effect was accomplished by pre- 
senting tone continuously, without shock, during a thirty- 
three-minute period of P-R in which the rats gradually re- 
turned to their normal rate of responding. Spontaneous 
recovery of the effect, on the following day, was, however, 
practically complete, suggesting that the tone had by no 
means lost its power to set up the anxiety. 

We may speak of this anxiety as a state induced by a stimu- 
lus which, in the past, has preceded negative reinforcement. 
It arises from experience and, if it is to be classed as a drive, 
it should be regarded as secondary. It begins with an S D that 
has become a secondary negative reinforcer and ends with the 
primary aversive stimulus. When the warning is over and the 
punishment borne, normal behavior is restored reminiscent 
of the way men describe their relief when a period of anxious 
waiting is ended: "Well, that's over!" 

Anxiety is, unfortunately, more common in men than in 
rats. Daily, and in many ways, we are subjected to punish- 
ment and threats of punishment. Law-abidance and education 
are both fields in which it is often difficult to achieve results 
without recourse to such motivation. Children are all too 
often made literate by the same aversive motivation that is 
used to keep men out of jail this is the philosophy upon 
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which a whole society may stand. Is it odd that such a society 
is plagued by a high incidence of anxiety-ridden members 
who may, at one time or another in their lives, need therapeu- 
tic assistance? If threats from one's personal environment and 
society in general become excessive, an individual's every day 
and every act may be dominated by anxiety. And if, as a child, 
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FIG. 70. The effect of an anxiety-inducing stimulus on rate of bar- 
pressing under P-R. T denotes the onset of a 5-minute tone; S, the 
occurrence of shock. The four curves (A, B, C, D) are for four succes- 
sive days. The response rate declines during successive periods of anxiety 
until, by the fourth day, it is almost zero during the presentation of the 
tone. Each cumulative response curve is an average for the group of six 
rats. (After Estes and Skinner, 1941.) 

he does not know what responses may be punished, but only 
that it will be strong, frequent, and inevitable, all the ordi- 
nary stimuli about him become 'dangerous* and anxiety- 
producing. The files of psychological clinics are filled with 
cases of morbid and obsessive anxieties which are clearly the 
outcome of disciplinary and social training overburdened 
with threats of punishment. As compared with others, includ- 
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ing so-called inferior or primitive cultures, our society is 
sorely beset in this respect. 

Avoidance 

In ourselves, in others, and in many animals, we seem to see 
responses that are made to prevent the impact of noxious 
stimuli. It would appear that such avoidance responses de- 
pend on some S D or other that signals the imminent arrival of 
a negative reinforcer or, more concisely, that these responses 
are discriminative operants. But what is the reinforcement for 
them? Any answer that proposes the response's "actual success 
in avoiding the stimulus" is inadequate because it glosses over 
the difficulty itself, to wit, how the absence or non-occurrence 
of a stimulus can be reinforcing. Were it not for very strong 
evidence that avoidance can be manifested by an organism, 
one would be inclined to deny its very possibility. 

A partial answer to this question has been suggested in 
Chapter 8, where we saw that the removal of a conditioned 
negative reinforcement could act as a positive reinforcer. 
There are many situations in which we commonly speak of 
avoidance when an animal escapes from such an S D for ex- 
ample, in the experiments of Mowrer (1939) and Miller 
(1941). In these cases, the best avoidance results are apparently 
obtained when the S D is continuously present until brought 
to an end by some response. But there are other cases in which 
the S D is of short duration and the primary negative reinforce- 
ment does not follow until some time has elapsed. Warner 
(1932) taught rats to escape shock by jumping over a barrier 
in an experimental chamber. When the response was well 
established, he introduced a one-second warning buzzer that 
preceded the shock by intervals of i, 10, 20, and 30 seconds. 
Using different groups of rats with each delay interval, he 
gave fifty trials per day under these conditions for a period 
as long as twenty days. The rats in the i -second group re- 
quired, on the average, three days to meet a criterion of six 
successive avoiding responses that is, they jumped the barrier 
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within the i -second interval and did not receive the shock. In 
the lo-second group, the median animal reached this criterion 
in the fourth day and one animal failed to solve the problem. 
The so-second group required seven and a half days, with four 
animals unsuccessful; and all of the animals in the 3O-second 
group failed. In addition to demonstrating avoidance in the 

absence of a continuous external S D 
which could be terminated by the re- 
sponse, Warner's experiment shows 
us that whatever discriminative or 
noxious value the S D acquired was de- 
pendent on the temporal remoteness 
of the negatively reinforcing stim- 
ulus with which it was correlated. 
Hunter (1935), employing a simi- 
lar experimental procedure with 
white rats, sounded a buzzer for one- 
fifth of a second; two seconds later 
a shock was presented unless the 
animal jumped into an adjacent 
compartment during the period be- 
tween the two stimuli. Trials were 




FIG. 71. Four-chambered 
octagonal box used in 
studying escape and avoid- spaced by 58 seconds and a maxi- 
ance behavior by rats. The mum of lg() trfals was a n owed per 

animal for meeting a criterion of 
ten successive avoidances. Of 91 
rats, only six failed to reach the 
criterion. The average number of 
trials was a little over sixty. 
Hefferline (1950) showed that rats, with long training, 
could be taught to avoid another type of noxious stimulus, 
strong light, when it was preceded for fifteen seconds by a 
series of clicks at half-second intervals. This is hardly compar- 
able to the studies just described, since the S D was essentially 
continuous, but Hefferline was also able to show avoidance 
conclusively when no change at all occurred in the external 



animal, when shocked by 
the floor grid in any com- 
partment or warned by 
some signal, can move into 
an adjoining chamber. (Af- 
ter Hunter, 1935.) 
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situation. This was in his "holding" experiment, wherein rats 
were noted to keep a bar in a "down" position for stretches of 
forty minutes or more, when 'letting go' of the bar was always 
followed by the onset of a negatively reinforcing light. Heffer- 
line ascribed this holding behavior to the fact that proprio* 
ceptive stimuli, aroused when the rat began to let up on the 
bar, had come to be noxious and "anxiety-inducing" in them- 
selves. Maintenance of the "down-response" was thus treated 
essentially as an escape from the stimuli aroused whenever the 
"up-response" began. 

Hefferline's explanation has much in common with the 
view, expressed by Mowrer and Lamoreaux (1947), that 
avoidance behavior is reinforced by anxiety-reduction; but it 
gives us a clue as to the possible stimulus factors involved. 
Further research will undoubtedly clarify this situation, but 
at present we seem to be justified in saying that the most com- 
mon acts of avoidance are positively reinforced by the ter- 
mination of an external S D which is also a secondary negative 
reinforcer, as in the experiments by Miller, Mowrer, and others* 
(pages 250-253). Other avoidances, like that described by 
Warner, in which there is a lapse of time between the S D and 
the primary negative reinforcer, are conceivably reinforced by 
the removal of proprioceptive stimuli or other parts of the ex- 
ternal stimulus compound (Schoenfeld, 1950). For conven- 
ience of expression, we may also speak of avoidance as anxiety- 
reducing, just as we sometimes speak of a food-reinforced 
response as hunger-reducing. This accords well with the 
teachings of some psychotherapists who insist that a wide di- 
versity of behavioral symptoms are to be understood as 'escape 
mechanisms'ways of alleviating anxiety or avoiding 'un- 
pleasant thoughts or ideas.' You will correctly infer, from this 
line of reasoning, that the extinction of avoidance responding 
depends on the extinction of anxiety-inducing stimuli. So 
long as the S D is capable of exciting anxiety, just so long will 
an avoidance response which cuts off these stimuli be posi- 
tively reinforced. 



314 PRINCIPLES OF PSYCHOLOGY 

The extinction of avoidance responding may be carried out 
in two ways. First, we may have the S D always followed by the 
noxious stimulus, with the response no longer effective in re- 
moving the S D . This is, of course, a return to the original 
anxiety situation, and will result in a weakening of the 
avoidance response as well as a resumption of the customary 
anxiety symptomsdepressed activity, cowering, and so on. 
Secondly, we could have the S D presented on schedule, but 
never followed by the noxious stimulus. This case duplicates 
the situation that prevails when the organism is successfully 
avoiding; and you might be led to wonder how the animal 
would ever 4 discover' that the noxious stimulus would not 
come. You would have to recall that the reinforcement for 
avoidance behavior does not lie in the non-appearance of 
anything; rather, it lies in stimulus removal. 

Actually, extinction occurs in this second case, because the 
S D maintains its negatively reinforcing status only when it is 
at least occasionally followed by the primary reinforcer. 
Without the shock or other noxious stimulus, its power dis- 
sipates (along with its anxiety-producing function). This is 
related to the experimental observation (Schlosberg, 1936 
1937; Sheffield, 1948) that the latency of the avoidance re- 
sponse to the S D goes through a cyclical lengthening and short- 
ening. In the course of a string of successful avoidances, the 
latency increases from trial to trial as the S D 's anxiety-pro- 
ducing power extinguishes. When, finally, the response comes 
too late, the noxious stimulus strikes the animal, and acts to 
restore the S D 's power. The next latency 'snaps back* to a low 
value, only to go through another lengthening phase with 
each successful avoidance. However, in extinction, it has been 
observed (Culler, Finch, Girden, and Brogden, 1935) that if 
an avoidance response passes by the time when shock is cus- 
tomarily given, it will continue to weaken. This weakening is 
sometimes strikingly rapid and it may be that it depends 
upon the violation of a temporal discrimination built up by 
the training schedule. 
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We have a great deal to learn about avoidance behavior and 
anxiety. Our present knowledge, gleaned mainly from animal 
study, indicates that the extinction of an avoidance response ij 
often extremely difficult, even in well-controlled experimen 
tal situations. It suggests that, at the level of human behavior, 
where all manner of stimuli may, by design or caprice, ac- 
company punishment, the problem of eliminating anxiety is 
one of the first magnitude. This is especially true of our own 
society, which so wantonly, deliberately, or systematically re- 
lies on punishment in the control of behavior. Ideas on the 
subject abound, among laymen as well as professional workers 
in psychotherapy, but the scientific validation of most of them 
still lies ahead. Is it wise to send the aviator aloft immediately 
after the crash? Should we force the person with a phobia tc 
face the feared object? Can a child's dread of darkness be re- 
moved by keeping him in a dark room? These and related 
questions are of deep concern to the mental hygienist, who has 
evolved his own set of rules for the prevention or removal of 
behavioral 'maladjustments' rules that have a pragmatic 
basis in the benefits sometimes derived from their use, but 
which as yet have only been partially subjected to analysis. 

The Conflict of Drives 

The laboratory, which permits the analysis of natural 
processes into components rarely or never seen in the massive 
events of daily life, also permits us to re-create the more com- 
plex events step-by-step for deliberate study. Although we 
have so far dealt mainly with motives operating singly, it must 
be evident to all that in the usual affairs of animals and men 
more than one drive may be present at any time. Before this 
already lengthy chapter is closed it may be worthwhile to look 
briefly at some of the known effects of co-existing motives 
upon behavior. In this connection, attention has often been 
focussed upon the conflict of drives. 

A word of explanation is needed about "conflict of drives" 
lest we lose contact with our earlier discussions. A moment's 
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consideration will disclose that it is not drives that conflict, 
but rather responses or response tendencies. As Shaffer (1936) 
has put it: "The term conflict means that the individual con- 
fronts a complex situation to different aspects of which he has 
learned to make antagonistic responses . . . [which] cannot be 

made at the same time " We speak of drive conflict when 

opposition between incompatible responses arises from drive 
operations like deprivation or stimulation. This is in contrast 
with conflicts originating in other ways for example, from 
the direct stimulation of antagonistic muscles simultaneously, 
or the simultaneous use of S D 's for incompatible responses. 
Direct stimulation has been investigated mainly by physiolo- 
gists; the "conflict of cues*' (or S D 's) has been of interest in the 
study of special discrimination problems (e.g., Nissen and 
Jenkins, 1943) like sensory "illusions/* Always, however, 'con- 
flict' refers to the instigation of incompatible movements, 
movements which cannot occur together and may be thought 
of as competing. In the discussion which follows, therefore, 
our use of 'drive conflict* will mean "response conflict created 
by drive operations"; and when we speak here of 'response 
conflict' we shall intend this to mean competition that is 
drive-induced. 

The co-existence of several drives can mean harmonious 
performance if the responses involved are simultaneously pos- 
sible; or it can mean, if the responses are incompatible, the 
kind of behavioral upset that all of us have experienced at 
one time or another. The signs of conflict, which are also the 
response characteristics measured in experiments, include 
increased hesitation or latency of response, vacillation or al- 
ternation between responses, frequency of 'blocking* or 
failing to respond at all. These measures have been used by 
several researchers who, within recent years, have begun a 
promising line of experimental attack upon the problem. 

Conflict situations differ in complexity depending upon 
the number and variety of competing responses. Two experi- 
mental arrangements, however, because of their simplicity, 
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provide a good starting point. In each case let us assume that 
the response is fully conditioned, and that other variables are 
kept constant, so that only the experimenter's drive operation 
is manipulating the strengths of the responses involved. (In 
this way, we avoid the complications that responses with un- 
equal training are differently affected by the same drive op- 
eration (Sears and Hovland, 1941), and so forth. While these 
are doubtless important variables, we may for simplicity's 
sake ignore them here.) 

i . An animal is placed into an alley apparatus in which he 
is trained to move toward or away from either end by the 
positive or negative reinforcements placed in the ends. Loco- 
motion is a kind of response, and this situation is often 
described in terms of "approach and avoidance." This ar- 
rangement is valuable because it affords a good picture of 
conflicting responses and allows relatively easy measurement. 
The student should remember, however, that the spatial re- 
sults of the animal's responding are not the prime considera- 
tion, and that analysis could as well be made of responses that 
do not require travel by the organism (Miller, 1948). Brown 
(1948), using white rats and such an alley, attached a small 
harness to his animals and was able to measure the pull ex- 
erted by the animals as they tried to move toward a positive 
reinforcer (food) and away from a negative (electric shock). 
The graphs in Figure 72 show how the force of the pull varied 
with distance of the rat in the alley from the point of rein- 
forcement. The two gradients of approach and avoidance, 
each obtained from animals who had had only one type of 
reinforcement, led to the following observations (Miller, 
1944): (a) the tendency to approach a goal is stronger the 
nearer the subject is to it ("approach gradient"); (6) the ten- 
dency to go away from a place or object avoided is stronger 
the nearer the subject is to it ("avoidance gradient"); (c) the 
strength of avoidance increases more rapidly with nearness 
than does the strength of approach. With this known, we 
may go on to establish three conflictful situations by simul- 
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taneously placing reinforcement in the alley ends according 
to three combinations. 

a. Both alternatives positively reinforced. Thus, we could 
use food and water with animals both hungry and thirsty; or 
even food at both ends, with only hunger drive present. This 
is an "approach-approach" conflict like that of the mythical 
ass between two bales of hay. The ass, it is told, starved to 
death, but you will readily agree that this is not a likely out- 
come! The story's humor arises from the fact that its ingredi- 
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FIG. 72. The intersecting approach and avoidance gradients obtained 
by Brown (After Miller, in J. McV. Hunt, 1944.) In connection witlt 
these graphs, see also Figure 59, page 253. 

ents are conceptually possible, but their attainment actually 
impossible: the ass is not a point in space equidistant from 
the bales; the bales are not equal S D 's, nor do they occupy 
mere points in space. In actuality, the slightest departure from 
ideal conditions means an increasing pull in one direction 
with a correspondingly decreasing pull in the other as the 
distance from it increases (see Figure 72). Conflicts like 
these are easily resolved for the organism. When men speak 
of being torn between equally attractive courses of action, 
there is usually the unspoken element that one course will be 
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rewarding and the other not, so that there is an undercurrent 
of anxiety lest the wrong choice be made. But in a world 
where all choices are rewarded, the approach-approach situa- 
tion should produce none but fleeting conflicts. 

b. Both alternatives negatively reinforced. Thus, either 
end of the pathway might be charged to give a painful shock. 
The organism tends to retreat from either end, but cannot get 
out of the dilemma. Movement away from the center, in either 
direction, increases the repelling power of the approaching 
aversive spot, so that he is caught and suspended between 
them in an agitated and conflictful immobility. If he were 
not confined to the alley, the animal would flee from the situa- 
tion altogether, probably in a compromise direction at right 
angles to the line connecting the two aversive spots. A phe- 
nomenon similar to this was demonstrated by Hovland and 
Sears (1938) with human subjects. The instructions were to 
draw on a square sheet of paper a diagonal line from the mid- 
point of the near side to one of the far corners. Each far 
corner had a light which could be flashed at the will of the 
experimenter, and the subject was to respond to the light 
which did flash by drawing the line as quickly as he could. 
After training to both corners, a conflict was set up by flash- 
ing both lights at once. The result in many cases was a block- 
ing that caused lines to be drawn straight ahead between the 
two far corners rather than diagonally. In contrast with this, 
we can easily suppose that if an organism were ringed by an 
aversive circle with escape impossible, he would bog down at 
the center, just as in the alley where the walls prevent leaving 
the situation. The effect of hemming an organism in on all 
sides by negative reinforcements is of great interest to students 
of abnormal behavior. The "escape from reality" and "retreat 
into phantasy" which they observe in many patients seem to 
be consequences of life situations in which punishment awaits 
every move. Where there is no escape, something is bound to 
give way. 

c. One alternative positively, one negatively, reinforced. 
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Thus, a choice of one direction is favored by both its own 
positive value and the negative value of the other. This is 
clearly a combination in which no conflict resides. One has 
only to answer such questions as how the two response ten- 
dencies summate, whether arithmetically or in some other 
fashion. 

2. The alley apparatus can be used for a conflict arrange- 
ment that is even simpler than the above. This involves locat- 
ing the reinforcements, whether positive or negative, at only 
one end so that we can consider locomotion toward that end 
as a single response that may be rewarded or punished. No 
problem arises when: (a) only positive reinforcements have 
been given at that end; or (6) when only negative reinforce- 
ments have been given. In either case, there is no conflict 
about approaching, or retreating from, the place of reinforce- 
ment. The instance that most concerns us is that in which the 
animals have on some occasions been rewarded for approach- 
ing, and on other occasions punished for the same response. 
The result is that he has tendencies both to approach and 
avoid, and we should be able to deduce in part from Figure 
72 what his behavior will be like. Since the approach and 
avoidance gradients differ in steepness and the point of inter- 
section, we might expect that there will be a place in the 
alley at some distance from the end, where the animal will 
tend to come to rest. Thus, if we put the animal down in the 
alley beyond that point, he would approach until the avoid- 
ance force exceeded that of approach. His approach momen- 
tum would probably carry him past that point, whereupon 
he would slow up and begin to retreat; the retreat momentum 
would again take him too far away, and he would then begin 
to approach once more; and so on. In short, there should 
appear not only a point of equilibrium but also a vacillation 
of movement direction around that point, with the animal 
unable to advance or retreat decisively. Just this type of thing 
has been obtained experimentally (Miller, Brown, and Lipof- 
sky, 1943; Rasmussen, 1940). As Miller has written (1944), the 
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situation is reminiscent of human dilemmas like that of the 
hard-hit but bashful lover who vacillates helplessly at a dis- 
tance from his loved one; or like that of a man eager for a 
higher salary but afraid to approach a 'tough' employer. 

Of the situations outlined above, there are two that seem 
to be outstanding generators of conflict and probably contain 
the germ of the admittedly more complex conflicts of daily 
life. They are, if we may recapitulate, the double avoidance 
case where either of two alternate responses is punished, and 
the case where a single response has had both positive and 
negative reinforcement applied to it. Before taking up some 
examples of conflict that are on the next level of complexity, 
it may be noted that the amount of conflict engendered be- 
tween two response tendencies is inversely related to the 
difference in strength of the two tendencies; or, the amount 
of conflict increases as the strengths of the opposing responses 
approach equality. This has been tested and confirmed in 
several studies (Sears and Hovland, 1941; Godbeer, 1940; see 
also a review by Bitterman, 1944). Barker (1942), for example, 
gave ten-year-old boys the task of indicating which of two 
proffered liquids they would choose to drink. In one part of 
the experiment, both alternatives were unpleasant (as deter- 
mined in advance of the experiment), but differentially so. 
Barker found that the more the alternatives differed in attrac- 
tiveness, the faster the choice was made and the fewer the 
vacillations in making the final decision. This example is one 
of the double avoidance type, but the available evidence 
shows that the hypothesis is probably valid for a single re- 
sponse with a combined history of reward and punishment 
(Miller, 1944). 

Our final case of conflict involves two incompatible reac- 
tions, either of which is reinforced but at the same time 
results in the loss of the other reinforcement or is punished. 
In other words, one positive reinforcement must be foregone 
(or punishment endured) when the subject chooses the other. 
The situation comprises two interacting instances of a single 
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response that is both rewarded and punished. It has been reo 
ognized (e.g., Hovland and Sears, 1938) that the case resembles 
more closely the conflictful contingencies of real life, and it 
has been studied by several investigators. In a series of experi- 
ments with conflicting light-aversion and hunger drives in 
the white rat, Tolcott (1949) employed an apparatus in which 
two incompatible responses were available for separate rein- 
forcement with darkness or food. Located at one end of a box 
was a bar that, when pressed, delivered food pellets, while at 
the other end was a platform or pedal which controlled a 
flashing bright light placed directly over the box. The animals 
were first trained, in separate preliminary sessions, to procure 
food by bar-pressing and to keep the light off by stepping or 
sitting on the platform, when getting off the platform meant 
the immediate return of the light. The bar and platform were 
separated by a distance greater than the length of the animal's 
body, so that in the conflict sessions when they were hungry 
they could not maintain platform contact while stretching 
over to press the bar at the same time (as many tried to do). 
In one experiment, Tolcott compared the behavior of his 
subjects in the conflict situation with that under hunger alone 
and under light aversion alone. Two of his interesting find- 
ings may be quoted. 

1. When the two drives are aroused simultaneously (conflict) the 
typical behavior pattern is an alternation between the responses short 
periods of pedal-holding alternate with brief flurries of bar-pressing. 

2. During these conflict sessions, more than half the experimental 
interval is spent on the pedal; nevertheless the same number of bar- 
responses are made as occurred during the sessions of appetite alone. 
This is accomplished by an increase in the net rate, or rate at which 
the bar is pressed during the spurts of bar-pressing. 

The latter is an interesting point and is in line with the find- 
ing that punishment is not necessarily sufficient to reduce the 
strength of a response. Indeed, it looks as if it is this very 
inefficacy of punishment that allows the conflict to continue 
unabated.. 
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In another experiment, Tolcott lessened the degree of con^ 
flict systematically by lowering the hunger drive, and com- 
pared behavior under the various strengths of hunger as it 
competed with the light aversion that was held constant. The 
use of light aversion was somewhat different here, since it 
involved the periodic and signalled presentation of light 
rather than continuous light. He was able to demonstrate that 
with increased pre-feeding, the warning signal for onset of 
light came to have a greater effect. Lowered hunger meant less 
conflict, more frequent returns to the platform in the warning 
period, less vacillation between the two incompatible re- 
sponses, and so on. The data, therefore, once more support 
the hypothesis that the amount of conflict is inversely related 
to the difference in strength of competing incompatible 
responses. 

A later study by Winnick (1950) produced data that are 
significant for our understanding of conflictful behavior, and 
that are relevant to our earlier discussion of experiments like 
Hefferline's on avoidance behavior. She replaced the platform 
of Tolcott's apparatus with a small vertical hinged panel 
against which the rat pushed to keep off the aversive light. 
The panel could be pushed some distance beyond the point 
at which a switch broke the light circuit, and a pen was at- 
tached to the panel that gave a continuous record of its 
movements on a kymograph tape. As before, a bar at the other 
end of the box, whereby food could be obtained, created a 
conflict situation when the animals were placed in the box 
hungry. As Tolcott had observed, animals vacillated between 
panel-pushing and bar-pressing in the conflict sessions. But 
the record of panel-pushing revealed that conflict was present 
even when no oscillation was apparent to the eye. The 
panel-pushing was not steady in force or extent, but was 
marked by large variations even though the animal might not 
for a long time release it sufficiently to allow the light to come 
on. Incipient movements toward the bar that stopped short 
of the light-switching-on point, alternated with retreats from 
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the switching-on point and pushing with renewed vigor. Fig- 
ure 73 is a sample bit o Winnick's records of this conflict. 

This ingenious experiment deserves to be followed up 
because of what it reveals about conflict and the way it bridges 
the topics of proprioceptive S D 's, covert response, anxiety and 
avoidance, and conflict behavior. The beginnings of one re- 
sponse, as we have said elsewhere, can generate proprioceptive 
S D 's which serve as anxiety arousers because they are cor- 







FIG. 73. Sample records of panel-pushing behavior by rats in a con- 
flict situation. When the recording pen was at the base line, the aversive 
stimulus (light) was present. The animals were hungry and the jagged 
records show that, though they often kept the light off, there was con- 
siderable vacillation of pushing behavior because the food-getting bar 
was present at the opposite end of the response chamber. The records 
are to be read from right to left. Three 4-minute samples of behavior 
are shown. (From Winnick, 1950.) 

related with ensuing punishment if the response goes to com- 
pletion. In this way, anxiety S D 's of muscular origin give rise 
to Inhibition' or 'repression' of behavior. A similar emphasis 
on the role of proprioception is to be found in the writings 
and actual therapeutic practices of some psychoanalysts (e.g., 
Wilhelm Reich, 1949). We shall not digress into this area, but 
there can be little doubt that the work of the clinician awaits 
the fruitful touch of the laboratory scientist. Studies like 
those of Luria (1932), Godbeer (1940), and others, show that 
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laboratory studies with human subjects in conflict situations 
are feasible. Some of the findings with animals have already 
been corroborated with human beings. In the end, however, 
our knowledge of man, though it may start with humbler 
organisms, must be rounded out by studying man himself. 

NOTES 

Our chapter has been long, but it would have been longer if we 
had paid attention to non-experimental studies of motivation, especially 
to the work of Sigmund Freud (1856-1939) who, more than any other, 
is identified with the modern emphasis upon the sexual drive in human 
beings. For Freud, this drive was the foundation for his general theory 
of psychoanalysis, the basic tenets of which are now available in many 
books and articles written for laymen as well as students and profes- 
sional psychotherapists. We cannot go into the intricacies of this theory 
here, but we may note that many of its concepts (e.g., regression, sym- 
bolism, and repression) have found their parallels in experimental re- 
search, even at the sub-human level. The future may show that this 
development has even more points of contact with behavioral science 
than are now apparent. 

The best single text on motivation to which we can direct you is 
P. T. Young's Motivation of behavior (1936) which contains an excel- 
lent account of the research in this field prior to 1936. Young has 
himself made significant experimental contributions to our knowledge, 
especially in studying the sub-hungers. 

The distinction between appetites and aversions is an old one, going 
back at least to the time of Thomas Hobbes (1651). Young, in the book 
mentioned above, treats these as two classes of drives, and so does 
E. C. Tolman, in a thought-provoking little book called Drives toward 
war (1942). Tolman's book, which leans heavily upon observations of 
animal behavior, also includes some interesting speculations concerning 
the relation of basic motives to several psychoanalytic concepts and to 
certain sociological and economic teachings. 
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EMOTION 

THE TYPICAL varieties of emotion are each connected with certain charac- 
teristic . . . trends of activity. Anger involves a tendency to destroy and 
forcibly to break down opposition. . . . Joy involves what we may call 
expansive activity. ... In grief there is a general depression and dis- 
turbance of the vital functions. . . . Fear . . . arises in a situation which 
demands action for averting, evading, or escaping a loss or misfortune 
which has not yet taken place. 

G. F. Stout, The Groundwork of Psychology, 1903 

A Preliminary Question 

In this chapter, as in the preceding, we are concerned, not 
with a principle, but a problem. The problem, in large part, 
is that of deciding what to do about the term emotion. Is this 
word to be kept in service as an aid to our understanding of 
behavior, or should we retire it from active scientific duty? 
The question may strike you as a foolish one. Emotion, one 
hears, is something that colors human life from birth until 
death. It is the essence of pleasure and the companion of pain; 
it is the spirit of ecstasy and the soul of despair; it is the friend 
of creative effort; it promotes well-being and ruins digestion. 
Do without it? Would we not be left with a colorless and cold 
existence? 

This attitude misses the point of our problem. We did not 
propose to shirk our obligation to deal with those aspects of 
behavior which, in common speech throughout the ages, have 
been called 'emotional.' We asked only if a clear and useful 
meaning could be given to the word. What we sought to em- 
phasize was the danger that lies in adopting, for scientific 
purposes, a term that has such a variegated history of usage 
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as this one. Other terms of ancient lineage, such as instinct, 
have suffered from the same defect, and have been dropped 
from the vocabulary of many present-day psychologists. Still 
others seem destined for a similar fate. If we are to retain this 
one (as we really intend to do), we shall have to hedge it about 
carefully. We must restrict and qualify its use in a way that 
leaves no room for misunderstanding. At the same time, we 
must not strip it so bare of meaning that it no longer pertains 
to the countless observations from which it originally 
stemmed. 

Some Observations 

It has been profitable for us, in preceding chapters, to take, 
as a starting-point for discussion, the bar-pressing behavior of 
the white rat. Suppose we return to the laboratory again, this 
time with an eye to any details of his responding which we 
may have slighted in our earlier descriptions. What will we 
find that is relevant to the topic now at hand? 

A number of items immediately present themselves. We 
note, for example, that an initial period of 'acclimatization' to 
the experimental situation is generally provided before any 
attempt is made to condition the bar-pressing response. With- 
out such a period, or periods, the time required to set up the 
habit is often quite prolonged. Familiarizing procedures are 
used in almost every type of animal experimentation. 'Gen- 
tling' subjects before training; giving them unreinforced 
'pre-trials' in runways and mazes; acquainting them with 
food-boxes; accustoming them to clicks, buzzes, and other 
apparatus noisesall these devices are conducive to rapid 
learning. A similar effect has also resulted from the use of 
sound-proof experimental rooms, constant conditions of 
illumination, and so forth. In the absence of such precautions, 
rats are not only slowed down in their rate of learning, but 
may also be seen to react in unusual ways. When introduced 
to the experimental situation, they may wash or groom exces- 
sively, crouch in corners, and void urine or feces. Movements 
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of exploration may be slow and cautious or absent entirely; 
and the slightest alteration of stimulus conditions may startle 
them or cause them to 'freeze* in position. 

A second item of interest turns up in connection with the 
extinction of Type R responses after a period of regular 
reinforcement. Here, as we remarked earlier (page 71), three 
things are obvious: (i) an initial high rate of bar-pressing; 
(2) recurrent periods in which bar-pressing may be com- 
pletely absent or much reduced in frequency; and (3) cor- 
responding periods of rapid responding which seem to 
compensate, in decreasing measure, for the periods of no 
response (see Figure 16). If we watch the animal closely, 
again we note bar-gnawing, urination, defecation, grooming, 
and other responses, together with attempts to get out of the 
situation. These responses, and the cyclical deviations in bar- 
pressing rate, are not so typical of extinctions that follow 
periodic (or aperiodic) reconditioning (see page 90). 

Easily observed changes of a similar sort appear when some 
negatively reinforcing stimulus is applied to our animals 
during periods of reinforced or non-reinforced bar-pressing. 
Besides the more or less complete suppression of the operant 
itself when the animal is slapped or shocked, he shows crouch- 
ing, climbing, jumping, approaching and retreating, rapid 
breathing. These and other movements are characteristic of 
his behavior when the negatively reinforcing stimuli are fre- 
quent or intense. 

Weak but clearly discernible effects upon bar-pressing rate 
may appear under other experimental conditions. The intro- 
duction of a dim light within the response chamber will reg- 
ularly produce a lowered frequency of response during P-R 
sessions; and even a mild tone or buzz, to be used as an S D or 
an S A in a discrimination experiment, may halt responses 
completely during their first few presentations. Extreme in- 
tensities of these stimuli may render the animal incapable of 
any movement whatever. They are no longer useful as S D 's, 
nor can they serve as aversive stimuli, the termination of 
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which is reinforcing the animal cannot make the coordinated 
movements which would remove them. 

These behavioral changes are not peculiar to any one 
experimental situation. Many studies have been made of the 
effects of lights, tones, shocks, withdrawal of reinforcement, 
and so forth, upon other animals than the rat, and in relation 
to other activities than bar-pressing. Our own observations 
merely confirm those which have been described by various 
investigators. 

It requires no great imagination for one to draw up com- 
parable examples of his own or others' behavior. Who has not 
been upset in strange surroundings or in the face of unex- 
pected change? Who has not been frustrated by his inability 
to meet a familiar situation with an accustomed mode of 
action? And who has not experienced, at least for a moment, 
the behavioral disruption which can follow an injury, an 
explosion, a loss of equilibrium, or some other form of sud- 
den, intense stimulation? We would hardly say that our rat's 
behavior is unique. 

Such phenomena as those we have described above lead us 
to a few general statements. Certain experimental operations, 
such as the taking away of positive reinforcement or the giv- 
ing of negative, result in certain operant and respondent 
changes. These effects sometimes fade out gradually with con- 
tinued exposure to the situation or with removal of the 
noxious stimuli. When one observes all or some of them, he 
feels fairly safe in saying that emotion exists. This is not the 
entire story, but it gives us a point of departure for what 
follows. 

Emotion and Drive 

If we were to define emotion tentatively, and broadly, as a 
state inferred from the change in strength of a number of re- 
flexes as a result of certain operations, we should not only 
epitomize the foregoing discussion, but we should also be 
making a statement that applies equally well to the concept of 
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drive as we treated it in the preceding chapter. Moreover, we 
would not be the first to recognize this similarity. Many stu- 
dents, ancient and modern, have noted as much. Descartes, 
in the seventeenth century, argued that "the principal effect 
of the passions [emotions] in man is that they incite and 
dispose [motivate] the mind to will the things to which they 
prepare the body." William James (1890) asserted that "in- 
stinctive reactions [read motives] . . . shade imperceptibly into 
each other. Every object that excites an instinct excites an 
emotion as well...." And William McDougall (1923) pos- 
tulated for most of the basic 'instincts 1 (again, motives) a 
closely related emotional counterpartflight, an 'instinctive' 
impulse, was associated with fear, an "affective state"; pug- 
nacity was associated with the emotion of anger; and so on. 

In our own discussion of aversions, it must have occurred 
to you that the effects of such stimuli as electric shocks were 
certainly 'emotional' in the everyday sense of the word; and 
to this we are forced to agree. In addition, we would have to 
admit that the scientific, no less than the popular, distinction 
between emotion and aversion is perhaps insupportable. The 
line of demarcation is at best very faintly drawn. The opera- 
tion by which we define aversion is that of presenting stimuli 
-shocks, slaps, lights, tones, and so on; but this is also one of 
the operations we use to define emotion. One difference be- 
tween aversion and emotion seems to lie in the intensity of 
stimuli that we present. 

This difference leads us to look at the matter in another 
Way. On page 232 we stated that stimuli have different func- 
tions. Beyond their reinforcing status, they may be elicitative 
(the respondent case), discriminative (setting the occasion for 
reinforcement of an operant), drive-inducing (aversive), 
and emotionalizing. We now suggest that the intensity con- 
tinuum cannot be ignored. Mild or moderately intense 
stimuli readily become S D 's; stronger, negatively reinforcing 
stimuli, the removal of which is positively reinforcing, may 
be called aversive; and extreme intensities are those we com- 
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monly treat as emotional. Take a weak light as an S D (a shock 
or tone would serve as well). A rat can be easily trained to 
press a bar whenever such a stimulus is presented. If, how- 
ever, we increase the intensity of this stimulus, we have 
trouble in separating the purely discriminative or eliciting 
status from the aversive that is, we find that the animal can 
be taught to turn off this light. Now, if the intensity is still 
further increased, the motivating function of the stimulus is 
obscured the animal may respond for a while in an adaptive 
fashion, but gradually his behavior becomes more and more 
disorganized. He runs this way and that; he leaps and climbs 
and falls about in his cage; and he may even end by lying 
prostrate on the floor, unable to make any movement that 
would free him from his plight. 

We begin to see, then, why it is that emotions and drives 
have so long and so persistently been confused with each 
other. Ultimately, we may decide that the fence between the 
two concepts does not justify the labor spent in its upkeep. 
For the present, however, we shall assume that the suggested 
difference in operations and behavior are great enough to 
warrant a distinction. 

Emotion as an Inferred State 

In our tentative definition, we spoke of emotion as an 
'inferred state/ We should now add, as in our discussion of 
drive, that such inferring is convenient, but not strictly nec- 
essary, and sometimes dangerous. If we slip into the practice 
of assigning to this hypothetical state any properties not given 
in our observations, we jeopardize the clarity of our thinking. 
It is especially necessary to avoid treating the state as some- 
thing 'mental' that causes our behavior. In so doing, we may 
be merely trying to lift ourselves by our own bootstraps we 
infer something from behavior which we then use to explain 
the very behavior from which the inference came. 

When we think of the state as physiological or neurological, 
rather than mental, we would seem to be on better ground. 
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There is indisputable evidence of widespread organic changes 
which accompany responses regularly called emotional. Yet, 
even here we must be wary. Our lack of detailed knowledge, 
both physiological and behavioral, does not permit us to go 
very far in relating the two areas. We are on much better 
footing at present if we consider an emotional state as a be- 
havioral construct, keeping uppermost in mind the opera- 
tions that provide for changes in response. 

Three Theories 

To illustrate the complexity of our problem, with reference 
to the points just made, let us consider briefly three ways in 
which the emotion of 'fear' has been treated by psychologists 
and physiologists. A very old, and still popular, view runs 
something like this. Fear is an episodic mental state an un- 
pleasant, all-pervasive, and 'stirred-up' consciousness, known 
at first hand only to its possessor. It is aroused by certain 
objects, events, or ideas; and it is expressed by various 'in- 
voluntary' or Voluntary* changes in behavior. It arises nat- 
urally, as when one quakes at a loud sound heard in the night, 
or through training, as when one trembles at the prospect of 
making a speech before a large audience. In its extreme form, 
terror, its expression is very dramatic. The eyes and the 
mouth open wide; the skin pales and cools and sweats; breath- 
lessness occurs, and the heart beats against the ribs; saliva 
stops flowing and the mouth becomes dry and sticky; the voice 
becomes husky and indistinct, or fails entirely; the pupils of 
the eyes enlarge and the hairs stand erect on the head and 
body surface. The normal digestive functions are halted and 
bowel or bladder control may be lost. The hands may be 
raised high, flight may ensue, or one may stand, trembling, 
rooted to the spot. 

In this classical picture, the sequence of events is (i) stimu- 
lation, (2) subjectively 'felt 1 emotion, and (3) bodily 'expres- 
sion' behavior. The same sequence, with differences in 
content, would be characteristic of joy, sorrow, anger, and 
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other time-honored categories. (We ignore, for the present, 
the problem of how an 'idea' may be a stimulus, how a 
'mental' state may be known except through behavior, or why 
'introspective' descriptions of the emotions seem always to 
point to S D 's arising from the bodily response itself.) 

William James, in 1884 an d later, proposed a drastic revi- 
sion of the earlier theory. Supporting his view by an appeal to 
a similar position arrived at by the Danish physiologist, 
Lange, he argued that the felt emotion follows, rather than 
precedes, the bodily expression. For him, the stimulating 
object or event led first to the expressive movements which 
themselves produced the 'awareness' of the emotion. "Com- 
mon sense says, we lose our fortune, are sorry and weep; we 
meet a bear, are frightened and run; we are insulted by a 
rival, are angry and strike. . . . The more rational statement is 
that we feel sorry because we cry, angry because we strike, 
afraid because we tremble." Stated more baldly than James 
would perhaps have liked, his position was this: a stimulus 
(e.g., a bear) sometimes elicits widespread changes in a per- 
son's behavior, and these changes in turn become discrimina- 
tive stimuli which prompt us, or the individual himself, to 
assert that fear exists. 

The James-Lange doctrine no longer upsets us. The objec- 
tively oriented person, trained to distinguish between what 
is observed and what is merely inferred, even finds it easier to 
understand than its predecessor. But a vigorous attempt has 
been made to turn our thoughts in a third direction toward 
brain physiology. In 1927, Walter B. Cannon and two asso- 
ciates demonstrated that the surgical removal of the sym- 
pathetic division of the autonomic nervous system did not 
eliminate the overt responses 'expressive' of rage, fear, or joy 
in cats. The same investigator adduced evidence to show that 
there are no response changes (in heart-rate, digestive activity, 
sweating, hair-erection, and the like) which are distinctive for 
the various emotions. All of them are characteristic of strong 
excitement, and many of them are present in fever, strong 
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exercise, and other presumably non-emotional states. From 
such considerations, Cannon was led to reject the James- 
Lange view that emotion, in general, is the 'awareness' of 
response, and that different response patterns give rise to the 
specific emotions of rage, fear, and so on. Instead, he offered a 
theory which relates the emotional 'consciousness* to the 
activity of a special brain center (the thalamus) which may be 
excited by stimuli independently of any expressive reactions 
of the muscles or glands. 

We need not enter into the intricacies of these theories or 
the debates which have revolved about them. Rather, let us 
remind ourselves that all three suffer from the defect that we 
described in the preceding section. In ascribing either a 
'mental' or a physiological character to emotion, they do not 
clarify our thought. The basic facts are these: something is 
done to an organism a stimulus is presented or withdrawn, a 
reinforcement is given or taken away; and, as a result, wide- 
spread changes in reflex strength occur. Some of these changes, 
like those of digestive activity, are detectible only with the 
tools of the physiologist; others, like the depressions or accel- 
erations of on-going operants, are visible to anyone's naked 
eye. In order to circumvent the round-aboutness of this ac- 
count, we say that the organism is emotional. To assert that 
the emotion is mental, does not advance our scientific under- 
standing, since there is no known procedure for investigating 
the mental as distinct from the behavioral. To say that it is 
physiological, has more justification, since we have some 
independent evidence of physiological changes which accom- 
pany the behavioral, but we still have a long road to travel 
before all the physiological 'correlates' of behavior are deter- 
mined. Certainly, the prediction and control of behavior is 
not far advanced by the assumption that either a 'felt' emotion 
(mental) or its 'expression' (behavioral) is caused by brain 
processes (physiological). 
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Response Patterns and Emotion 

For many generations, men have treated certain responses 
or groups of responses as indicative of emotion; and, down 
through the years, attempts have regularly been made to 
distinguish one emotion from another on such a basis. The 
classical theory was oriented in this direction, and so was the 
James-Lange view that followed it. Each major emotion was 
to be defined, at least in part, in terms of a pattern of bodily 
changes, alike in all individuals, with components that were 
presumably apparent to any naive observer. It behooves us, 
therefore, to ask whether this practice is sound, whether emo- 
tions can be described as specific response patterns. 

i. Facial expression. The answer to this question is not as 
simple as you might think. It seems obvious that the face of 
an enraged man is unlike that of a frightened one; and, seem- 
ingly, no one could mistake the face of joy for that of sorrow. 
Yet, when we look at the matter closely, we find ourselves in 
trouble. Is my expression of fear or joy like yours? Does the 
anger of the Asiatic resemble that of the European? Are these 
expressions 'natural' or are they 'acquired'? How can we tell 
whether they are 'real' or 'feigned'? Are the responses "uni- 
versal and immutable," as argued by Duchenne (1862), or was 
William James right in saying that "everyone of us, almost, 
has some personal idiosyncrasy of expression?" 

The history of facial-expression studies gives us a rather 
confused picture, but one or two conclusions may today be 
drawn with some assurance. First, it would appear that a cer* 
tain limited number of facial response patterns are recogniz- 
able in human children shortly after birth. A clearly distin- 
guishable smile, which precedes and is closely related to the 
laughing response, may be evoked after two to fifteen weeks 
of infancy (Jones, 1926). It has been pointed out by Young 
(1943) that both smiling and laughing tend to occur with 
(a) the satisfaction of hunger or thirst, (b) relief from dis- 
comfort, (c) presentation of certain colors, sounds, tastes and 



336 PRINCIPLES OF PSYCHOLOGY 

contacts, and (d) the free movements of play or dancing. In a 
word, they come as a consequence of presenting positive rein- 
forcers or removing negative reinforcers. Frowning, which is 
related to crying as the smile is related to the laugh, also 
occurs in infancy as a recognizable pattern. Young says that 
this response accompanies (a) hunger or thirst, (b) painful 
stimulation, (c) dazzling lights, loud sounds, or other intense 
stimuli, and (d) frustration or blockage of response. That is 
to say, negative reinforcement or the withdrawal of positive 
reinforcement. 

Secondly, we note that such expressions can apparently be 
strengthened and modified operant-wise. The child, we say, 
comes to 'use' an expression 'for his own purposes/ He de- 
velops an 'ingratiating' or a 'disarming* smile; he 'gets his 
own way' by crying; and so on. His facial expression, like his 
vocal, is shaped in one way or another by the selective rein- 
forcement which his family, his friends, and others provide. 
The 'natural' smile of babyhood may be displaced almost 
entirely by a smile that is 'artificial,' 'studied,' or 'forced/ So 
powerful are these strengthening influences that the naive 
expression is often obscured. In some adults, a fairly complete 
immobility of features for example, the 'dead pan' of the 
professional card-playermay be achieved when it is of advan- 
tage to do so. Or, at the other extreme, as in the expressions 
of the skilled actor, a variety of subtle variations of response 
may be developed through their effectiveness in molding the 
reactions of others. As with many common forms of behavior, 
facial responses come under the control of the organism's 
social environment. It should, therefore, occasion little sur- 
prise to find that the facial habits of adult Chinese are as 
different from our own as forks are different from chopsticks 
(Klineberg, 1940). 

Thirdly, we may say that, as far as the adult human being is 
concerned, facial expressions are poor indices of the prevail- 
ing emotional state. A well-known study by Landis (1924) 
points this up rather dramatically. Mature human subjects 
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were presented with a variety of presumably emotion-arousing 
stimuli (electric shocks, explosions, etc.). Resultant changes 
in pulse, respiration, and heart-beat were recorded and, in 
addition, photographs were made of the facial expression of 
each subject in each emotional situation. Analysis of these 




Fie. 74. Two representations of "startle," an unconditioned pattern 
of muscular responses to such stimuli as pistol shots. This is one of the 
few relatively stable 'emotional* patterns in human adults. This charac- 
teristic spasm is over in less than a second. (From Landis and Hunt, 
The startle pattern [Farrar & Rinehart, 1939].) 

photographs showed that there was no facial expression typ- 
ical of any stimulus situation or any verbally reported emo- 
tion. Nor was there any relation between reported emotion 
and the recorded internal changes. Landis was led to conclude 
that the behavioral distinction between emotions lay, not in 
the facial response pattern or the organic changes, but in 
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the nature of the stimulus situation and the degree of general 
disturbance evoked. Verbal reports of such states as surprise, 
anger, exasperation, and disgust were paralleled mainly by a 
decrease in the amount of facial movement, in the order here 
named; and each individual characteristically used no more 
than two or three distinct patterns of expression in all the 
situations to which he was exposed. If we consider these re- 
ports as discriminative operants based upon similar states in 
all the subjects, we can only conclude, with Landis, that facial 
expression is a very untrustworthy guide to the type of emo- 
tion involved, at least for adults. 

2. Patterns in infancy. The story of infant expressions of 
emotion is much the same. Except for such patterns of re- 
sponse as laughing and crying we look in vain for specific 
facial or bodily movements that will permit us to distinguish 
between emotional and non-emotional behavior or between 
one emotion and another. Watson (1924), from his study of 
infant responses to loud sounds, bodily restraint, fondling, 
and other forms of stimulation, was led to argue that three 
basic patterns could be distinguished in newborn babies, and 
he named these patterns fear, rage, and love. Today, however, 
in the light of similar studies by other investigators, we ques- 
tion Watson's view. Sherman (1928), for example, has shown 
us that the observation of response patterns alone (as 
when the stimulus conditions are deleted from moving-pic- 
ture records of the experiments) provide for a great deal of 
disagreement among observers as to the emotions actually 
displayed. Unanimity of judgment by nurses, medical stu- 
dents, and others, depended largely upon their knowledge of 
the circumstances in which the behavior took place. In this 
field, as in that of adult facial expression or of 'physiological' 
reactions (blood-pressure, respiratory, and other changes), the 
attempt to find uniform emotional patterns of response has 
been singularly unsuccessful. 

You may have anticipated such findings because of the way 
in which we approached our problem. We said that emotion 
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is to be distinguished from motivation on the basis of the 
operations involved and the widespread changes in reflex 
strength observed, but we did not attempt to specify any par- 
ticular responses or patterns of response that were uniquely 
emotional in their nature. Perhaps there are such, observable 
at the sub-human level and in the early years of human 
growth. A pattern of 'rage,' for example, has been observed in 
cats, dogs, and other animals, and we have already mentioned 
the smiling and frowning responses of babies. But there is 
little reason to believe that we shall ever be able to classify 
man's emotions exclusively in terms of his behavior. 

Table XII 

THE IDENTIFICATION BY ADULTS OF EMOTIONS "EXPRESSED" BY INFANTS 
(Data from Sherman, 1928) 



Names 
Given 


Initiating Conditions 


Not Shown 


Shown 


Hunger 


Drop- 
ping 


Re- 
straint 


Needle- 
prick- 
ing 


Hunger 


Drop- 
ping 


Re- 
straint 


Needle- 
prick- 
ing 


Anger 


13 


15 


16 


9 


*5 


5 


29 


3 


Fear 


7 


5 


5 


9 


o 


*7 


4 


7 


Hunger 


7 


6 


2 


2 


7 





o 


o 


Pain 


5 


3 


4 


3 


2 


2 


1 


13 



Some of the names given by graduate students in psychology to 
infant "emotional" behavior (as seen in moving pictures) when (a) 
the initiating conditions were shown (or named, as for "hunger") and 
(b) when the initiating conditions were not shown. The table may be 
read as follows: 13 students called the infant's behavior "anger" when 
the behavior was aroused by "hunger" but the initiating conditions were 
not shown, etc. 
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Specific Emotions 

What, then, can we say about the traditional categories of 
emotion that we commonly respect in our everyday speech 
and which have been the subject matter of so many psycholog- 
ical treatises? Are we justified in talking of fear, rage, joy, 
sorrow, and the like, when they have never been defined to 
the satisfaction of the objectively oriented scientist? It will 
not do, in answering these questions, to argue that, since we 
have used such words or their equivalents down through the 
ages and in many lands, they must represent fundamental 
realities; nor will it do to appeal to personal 'experience' or 
'consciousness' of these different emotions. Error, no less than 
truth, may persist for long periods of time; and one's report 
of an 'experienced' emotion does not ordinarily tell us what 
S D 's determined it. We might rather take the same path that 
we followed in our treatment of motivation. Let us ask about 
the effect of specific operations upon the changes in strength 
of groups of reflexes. Perhaps, in the end, we shall be able to 
justify the popular distinctions, but we ought not to begin 
by accepting them too readily. 

i. Fear. In our discussion of anxiety in Chapter 9, it may 
have struck you that we could as easily have used the word 
fear for this inferred state and called it emotional. This is, 
indeed, the case. It is difficult to draw a sharp line between the 
motivational and emotional aspects of anxiety, and the differ- 
ence between fear and anxiety is hardly worth arguing about. 
The important thing is not the name that we use but the fact 
that some originally neutral stimulus comes to be a secondary 
negative reinforcer and alters the probability that certain re- 
sponses will occur. A distinction between fear as emotion and 
as drive is possibly useful in some cases, as when the organism 
seems unable to make the responses which would remove the 
activating stimuli. Fear then qualifies as distinctly emotional 
and is sometimes called terror. However, we could study the 
matter in detail without ever mentioning any of these words; 
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the experimental operations and the behavioral changes are 
the truly significant things. If we choose to apply these clas- 
sical terms, it is only because serious students of motivation 
and emotion have often used them in connection with situa- 
tions and behavior that are indistinguishable from those with 
which we are concerned. 

From the earliest days of theorizing about fear, emphasis 
has been placed upon responses mediated by the autonomic 
nervous system respondents, like paling, sweating, shivering, 
and so on. But these responses, as Cannon has made clear, are 
not sufficient to define the emotion adequately. They may 
occur in other states, some of which, like fever, would not be 
called emotional. The changes in strength which help us 
most are those depressions of on-going operant behavior 
which are obvious to any observer. The organism shows a 
marked decrease in the frequency of many of his normal, 
everyday responses. The rat, for example, at the sound of the 
tone which was followed by shock in the Estes-Skinner experi- 
ment (page 309), no longer pressed the bar at his accustomed 
rate; and we have reason to believe that all of his other 
operant behavior in the experimental chamber was similarly 
affected. The human being, in a comparable situation, talks 
less, makes fewer movements, is hyper-sensitive to small 
stimulus changes, approaches new objects gingerly, or brings 
his routine activities to a complete standstill. These are some 
of the changes which serve as S D 's for us in ascribing fear to 
another. 

The operation we have selected in defining fear is the 
presentation of a secondary negative reinforcer. It is unlikely 
that this is the only effective condition. Primary negative rein- 
forcers (shocks, loud sounds, etc.) may have the same be- 
havior outcome; and there may be still other inducing factors. 
It is best, however, to leave it for future research to broaden 
the group of operations and to make further distinctions. 

In line with the present analysis, any stimulus situation may 
become fear-instigating if associated with aversive stimuli. 
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This has long been recognized by specialists in the study of 
morbid fears or 'phobias/ Scarcely an object or event can be 
named which has not been for some person at some time a 
source of overpowering fear. Sharp objects, open spaces, run- 
ning water, horses, dirt, automobiles these, and countless 
other components of our everyday outside world may come, 
through conditioning, to alter severely our response ten- 
dencies. Even internal, response-produced stimuli may be 
effective, as when we are said to fear a 'thought/ an 'idea/ or 
the performance of some act. When the circumstances of life 
have conspired to give a negatively reinforcing status to too 
many of the aspects of the environment, outer or inner, it 
may even make sense to speak of 'generalized fear/ 

2. Anger. When positive reinforcement is suddenly with- 
drawn, a rat's bar-pressing, as well as his general activity in 
the experimental situation, will show a quick acceleration, 
and he may even be seen to 'attack* the bar vigorously with 
his teeth. Pigeons, when crowded into narrow quarters or 
confronted by trespassers upon their well-staked-out preserves, 
will fight each other viciously (Craig, 1921). Other animals 
respond similarly in similar situations. Hebb (1946) has re- 
cently given us an especially interesting account of chim- 
panzee behavior, from which it appears that the breaking of 
an operant chain is a fundamental factor in producing sudden 
and violent attack behavior, self-injury (pounding the head 
on the floor), screaming, and, on some occasions, a response 
depression or 'negativity' which may reach the point where 
the ape will turn away entirely from some formerly positive 
reinforcer. Hebb applies the words rage, temper tantrums, 
and sulking to such behavior, and speaks of the initiating 
circumstances as "the removal of a desired object," the "fail- 
ure to get something," and the "frustration of an expecta- 
tion." The breaking of a chain of on-going responses may, 
however, apparently be effected by the withdrawal of positive 
reinforcers and the presentation of negative. Thus, Hebb 
speaks also of teasing, as when one ape repeatedly interrupts 
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another's activity-in-progress by screaming, spitting at him, or 
making threatening gestures. (It seems obvious that 'teasing' 
might also be applied to those cases in which an 'expectation* 
is set up and then 'frustrated/) 

In his treatment of infant emotions, Watson (1924) cited 
'restraint of movement' as the principal cause of rage. We can 
see in physical restraint another way of interrupting a chain, 
and some credence may therefore be given to Watson's posi- 
tion. But it should be noted that the restraining action must 
be imposed upon movement-in-progress it must be truly 
'thwarting.' Dennis (1940) has pointed out that a new-born 
infant's random movements may be restricted greatly and for 
long periods of time, as by swaddling clothes or cradle-boards, 
without noticeable emotional upset unless the 'restraint' is 
forceful enough to constitute a negatively reinforcing 
stimulation. 

The operation of chain-breaking may be detected or sug- 
gested in many everyday situations allegedly productive of 
anger. In a study by Gates (1926), women students were asked 
to report all instances of this emotion during a one-week 
period of self-observation, giving in each case an account of 
the precipitating circumstances. Among the situations men- 
tioned by these subjects were scoldings, criticisms, refusals of 
requests, tardiness of friends in keeping appointments, get- 
ting a wrong number on the telephone, failure in operation 
of such objects as watches, fountain-pens, and typewriters, 
delays in bus or elevator operation, clumsiness in dressing or 
sewing, disobedience of dogs or children, interrupted sleep, 
and loss of money. These occasions were reported to engender 
"impulses" to (a) make verbal retorts, (b) do physical injury 
to the offender, (c) injure inanimate objects, (d) get out of the 
situation, and (e) cry, scream, or swear. The order of fre- 
quency with which these tendencies were reported is indicated 
by the order of mention. The verbal retort was specified five 
times as often as crying, screaming, or swearing. Also, persons, 
rather than things, were reported as the main source of anger. 
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Obviously no one pattern of responses for all subjects was 
peculiar to any one situation, since the responses involved 
were dependent upon the individual's history of conditioning, 
yet some similarity of outcome is indicated by the fact that 
the responses fall into these five groupings. 

In spite of the fact that no two persons react in exactly the 
same way as a result of the anger-producing operation, there 
appear to be certain fairly common elements of such behavior 
as we ordinarily observe it. Increased muscular tension (some- 
times seen in the clenched fist, the grimly set jaw, and the 
rigid posture of an angry man), movements of advance or 
attack, stamping of the feet, elevation of the voice, and so on, 
are often enough seen to suggest the state even when we are 
unaware of the initiating conditions. 

3. Sorrow and joy. The changes in rat behavior resulting 
from stimuli that have been regularly associated with electric 
shock often lead observers to attribute fear to these animals. 
The sudden burst of bar-pressing and biting responses at the 
beginning of an extinction session is frequently said to show 
anger. We now see that the use of these two terms is justified 
in some degree. Sorrow and joy, on the other hand, are seldom 
imputed to the rat in any situation, even by persons who reg- 
ularly employ such terms in connection with the behavior of 
dogs, chimpanzees, and human beings. Are there operations, 
different from those we use to define anger and fear, which 
are related to special changes in reflex strength? 

We are loath to reply to this question, for several reasons. 
Psychological literature does not abound with treatment of 
these concepts. One of the most authoritative discussions in 
recent years (Young, 1943) gives them barely a mention; and 
most textbooks do little more than assume their existence* 
Unlike anger, and especially fear, they have seldom been the 
subject of scientific investigation. Moreover, those who at- 
tempt to give meaning to these terms are in no great agree- 
ment, possibly because they have so often been concerned 
with the fruitless task of identifying these emotions by facial 
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expression. Yet, it is improbable that the present reader will 
have the least doubt about the existence of sorrow and joy. 

It would perhaps be wise to say nothing more about the 
matter here. Yet we cannot resist mention of two possibilities 
that present themselves in connection with familiar descrip- 
tions of sorrow and joy. To us it seems clear that the complete 
removal of secondary positive reinforcers occasions a wide- 
spread behavioral depression which involves changes com- 
monly referred to as those of grief, sadness, or sorrow. The 
irretrievable loss of money, home, possessions, family, or 
friends seems to bring an obvious altering of muscular tonus 
(one is 'bowed down* under the 'weight of sorrow'), a slowness 
or absence of reaction to customary social stimuli, and a 
decrease in the frequency and force of speaking, not to men- 
tion the appearance of sobbing and moaning. 

The withdrawal of strong secondary reinforcers implies the 
breaking of response chains, and this is an anger-producing 
operation. How, then, can we call it a condition of sorrow? 
The answer here may be that anger is the first stage in the 
production of sorrow. If the loss of secondary reinforcement 
is impermanent or restricted to but a few chains of action, 
anger may be the result, but if the loss is prolonged and re- 
lated to many chains sorrow will ensue. Accounts of sorrowful 
behavior seem to be in part accounts of anger. Thus, Foley 
(1935) has described the grief of chimpanzees as due to such 
factors as a refusal or delay in complying with their expressed 
wants or desires, operations which Hebb (page 342) treats as 
productive of anger. And have we not all observed that great 
losses are commonly met at the outset by cries of rage and 
resentment, to be followed soon by the deepest of depressions? 
Perhaps, even, the 'sulkiness* so often taken to be a form of 
anger is, in good part, the beginning of sorrow. 

If the removal of secondary positively reinforcing stimuli 
is important for sorrow, certainly the presentation of such 
stimuli should facilitate those responses we call joyful. Again, 
we have little evidence to present, but the non-experimental 
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observation of human beings and certain animals points con- 
vincingly in this direction. Almost everyone has noted the 
changes in the behavior of children at the approach of a dot- 
ing relative or family friend, the announcement of an un- 
expected holiday, the promise of a trip to the beach or the 
zoo, or the sound of the bell that signals a recess from school. 
The reaction picture on such occasions is one of excitement, 
just as marked in its own way as the depression of sorrow. 
Lively, free, and extensive movements of arms and legs (the 
child 'dances for joy'), smiling, laughing, excessive vocaliz- 
ingthese and other activities are commonly observed when 
a child is presented with verbal or other stimuli that have in 
the past been followed by strong positive reinforcement. 
Adults are usually less expressive, and react in a less diffuse, 
more differentiated fashion than do youngsters, but some 
changes in tendency to respond are usually apparent. 

These suggestions are put down with hesitation, because 
the problem of identifying specific emotions is a complex one. 
We have not made an exhaustive attack upon this problem, 
but an important aim will have been accomplished if we have 
shown the way in which an objective, experimental approach 
must proceed. One thing is certain: there are operations, 
involving stimulus presentation and removal, which do more 
than change the strength of a single response (as in condition- 
ing and extinction); they change the strength of many re- 
sponses. We have argued that these latter changes, taken in 
conjunction with the operations involved, seem to be related 
to traditional descriptions of fear, anger, sorrow, and joy. But, 
even if this were not entirely true, we would still have the 
problem of accounting for the changes resulting from the 
operations. 

Emotional Conditioning 

It is unnecessary to say much about the conditioning of 
emotions, in view of the instances of it which have already 
accumulated in this text. It has long been recognized that 
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emotion involves a great deal of autonomically governed be- 
havior; and, since Chapter 2, you have known that this kind 
of behavior is readily conditioned in Type S fashion. Our 
treatment of the effects of punishment, in Chapter 4, made 
explicit note of emotional conditioning; and statements of 
similar import have occurred in our discussion of several 
other topics. Emotional conditioning is a fact, and so is emo- 
tional extinction, although our knowledge of these processes 
is still at a fairly primitive level. 

A classic instance of conditioned emotion may be cited 
here. Watson and Rayner (1920) presented a nine-months-old 
boy with a white rat at the same time that a loud sound was 
produced by hammering on a steel bar. The rat had not pre- 
viously disturbed the child, whereas the noise had elicited a 
violent 'fear reaction/ Only a few combinations of the two 
stimuli were required to give the rat a strong conditioned- 
stimulus function. Moreover, the effect was shown to gen- 
eralize to other furry animals and several inanimate objects 
of a 4 furry' nature even to cotton wool. 

Criticism was later launched at Watson's definition of fear 
(see page 338), but several studies have confirmed the funda- 
mental observations of a conditioned emotional change, and 
a few investigators have gone further. Jones (19243, ig24b), 
working with a three-year-old boy who came to her with an 
already well-established emotional reaction to rabbits, rats, 
and other animals, undertook to eliminate this effect. The 
procedure first employed was that of introducing the rabbit 
to the the child gradually while the latter was playing with 
three other, unafraid, children. This technique was working 
well when an unfortunate experience with a large dog served 
to recondition the emotion. The method then adopted was 
that of presenting the rabbit at times when the child was 
eating a highly desired food. In order to avoid a violent upset 
on any test occasion, the animal was never brought so near to 
the child as to depress his eating behavior. This procedure of 
combining the extinction of some responses with the positive 
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reinforcement of others was successful in eradicating the fear. 
In addition, Jones found that the extinction generalized to 
such a degree that the other, formerly frightening, animals 
and objects lost their power to evoke the emotion. 

In view of such findings as these, you may be surprised to 
learn that very few follow-up studies of the conditioning and 
extinction of human emotions have been made. Obviously, 
much work remains to be done in this area. It is understand- 
able, however, that experimentalists (and parents) hesitate to 
subject children to emotional conditioning unless there is a 
control of factors, in the home and elsewhere, such that the 
often lengthy process of extinction can be carried to comple- 
tion. The usefulness of preliminary research with animals is 
clearly indicated. We do well, perhaps, in following the lead 
of medical scientists and others who test out their procedures 
at a phylogenetic level where it will not endanger human 
health and development. 

Observing Our Own Emotions 

It was stated earlier (page 340) that our 'consciousness' of 
our own fear, joy, and so forth was not a satisfactory reason 
for assuming the reality of these states of reflex strength. 
Before leaving the topic of emotion, something further should 
be said about this statement. 

When we say that we observe anything, we mean no more 
than that we make a specific response to a discriminative 
stimulus or stimulus compound. Such stimulation may, in 
psychology, arise from the behavior of other organisms, in 
which case we ordinarily speak of 'objective' observation. It 
may also arise from our own responses, affecting us through 
our eyes, ears, or the sense-organs in our muscles (propriocep- 
tors). When you say, of another person, "He is angry," you are 
behaving in much the same way as when you respond to your 
own behavior by saying "I am angry." The difference lies 
mainly in the discriminative stimuli. Moreover, your re- 
sponses to either of these two stimulus situations are made 
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because they have in the past been reinforced in their pres- 
ence you have actually been taught to discriminate by some- 
one in the course of your behavioral development. 

If this is clear, let's go a bit further. When you were taught 
to say "I am angry," your teacher could not himself observe 
(respond to) all the stimuli present, because some of them 
were produced by your covert responding. He could observe 
the external stimulus situation and some of the changes in 
your behavior that followed, but not all. Which stimuli be- 
came important for you as S D 's he could not tell. Your teacher 
may have said only "You're angry, aren't you," when he saw 
your overt behavior. When you come to say "I am angry," 
your S D 's may not be limited to those which led to his re- 
sponse. The astonishing fact is that, after such training, there 
is so often good agreement between one's own report of anger 
(or some other state) and the report of an onlooker not sel- 
dom is the assertion "You look angry" answered by the asser- 
tion "I am angry." 

We have already said, however, that the response "I am 
angry," as well as such typical non-verbal accompaniments as 
fist-clenching, may be made independently of any anger-pro- 
ducing operation whatever. The use of these words and 
gestures may be effective in producing other kinds of rein- 
forcement than that provided by our 'teacher.' They may be 
used, for example, because they have been reinforced by flight, 
or some form of compliance, in another. In such cases it is 
sometimes hard for us to say whether a person is 'really angry,' 
'just bluffing,' or 'hunting for sympathy' unless we know his 
history very well and the situation in which he is behaving. 

By now you should be able to see why psychologists have 
hesitated to lean upon verbal reports of emotion. These re- 
ports are discriminative responses as much as any other, but 
some of the S D 's are private and one cannot be sure that they 
always go hand in hand with those that are public. In any 
case, you should appreciate that one's reports of his own emo^ 
tions are derived initially from the discriminative training 
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given by someone else. Hence, they can hardly be used as the 
test of any objective distinction between anger, joy, and any 
other states. 

Emotion and Feeling 

In Chapter 8 (pages 254-256), we noted that the problem of 
reported 'feelings' is similar to that of reported 'emotions/ 
When someone tells us that an object, a color, or a design is 
'pleasant* or 'unpleasant/ he is reporting upon his own reac- 
tions to that object, color, or design. A positive, or a negative, 
reinforcer has been presented or withdrawn; a change in be- 
havior has taken place, including, perhaps, incipient move- 
ments of approach or withdrawal; these movements (or others) 
provide the S D 's for his verbal responses his 'affective judg- 
ments/ The situation is undoubtedly similar to that which 
obtains when we ask subjects to identify their emotions. We 
can easily understand why feelings and emotions are charac- 
teristically lumped together, and why people have often tried 
to classify emotions as 'pleasant' or 'unpleasant/ The opera- 
tions of giving and taking away reinforcement are important 
in both emotion and feeling; whether we report one or the 
other depends upon the particular S D complex (as well as the 
request for a report) that is operative at the moment. 

Finally, we can see why feelings and emotions are generally 
treated as 'subjective/ We might suggest that you extend this 
line of thought a little farther. Where, in essence, would 
"beauty" and "ugliness" lie? And what is the basis for our 
ethical judgments of "good" and "bad"? These are interesting 
questions, in psychology as in other disciplines, but this is 
hardly the place for their discussion. We have perhaps 
equally interesting matters for consideration in the chapter to 
come. Let us, then, move on. 
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NOTES 

An excellent review of past treatments of emotion by psychologists and 
physiologists may be found in Chapters XI and XII of Woodworth's 
Experimental psychology (1938); and you may find interesting the 
related chapters (X and XIII) dealing with feeling and the galvanic 
skin response. It will also pay you to read the discussions of emotional 
'expression' (Landis) and the physiological theory of emotion (Bard) 
in the Handbook of general experimental psychology, edited by Carl 
Murchison. For an account of the James-Lange theory of emotion, 
you ought to go to that written by James himself in his Principles of 
psychology f Vol. II, Chapter 25 (1890). The most inclusive modern text 
on emotion is Young's Emotion in man and animal (1943). The intrinsic 
interest and readability of these accounts should not blind you to the 
fact that they do not attempt a systematic behavioral analysis of the 
problemone that is free of mentalism and physiology. 
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SOCIAL BEHAVIOR 

SOCIAL BEHAVIOR comprises the stimulations and reactions arising be- 
tween an individual . . . and his fellows. . . . Social psychology must 
not be placed in contradistinction to the psychology of the individual; it 
is a part of the psychology of the individual, whose behavior it studies 
in relation to that sector of his environment comprised by his fellows. . . . 

F. H. Allport, Social Psychology, 1924 

Introduction 

In one sense, this chapter is anti-climactic; in another, it is 
preparatory. Your introduction to the problems of general 
psychology was essentially completed with the discussion of 
motivation and emotion. The present chapter represents in a 
small way how our basic principles may be extended into 
areas of behavior which, though complex, are compelling in 
interest and importance. Psychologists have painstakingly 
amassed a large body of information about the social conduct 
of animals and human beings. It will be our task here to show 
how this area may be approached with the analytical tools now 
at your command. Only considerations of space prevent our 
doing the same thing for other special areas, such as abnormal 
psychology, developmental psychology, and so on. 

We have said before (page 257) that social stimuli do not 
differ from other stimuli in their dimensions. Rather, the 
difference is one of origin. They arise from other organisms, 
their behavior, or the products of their behavior. Moreover, 
social stimuli do not differ in their function from those of 
inanimate origin; they act as eliciting, reinforcing, discrim- 
inative, and so on. Social life arises because social stimuli come 
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to exercise these functions. These facts give us a solid step- 
ping-off point for the discussion to come. It will help, too, to 
recall that sub-human organisms, no less than man, exhibit 
social behavior. Indeed, from them we may learn a number of 
things that will enable us to see man's social activities in a 
more objective light. 

In venturing to extend our principles into social behavior, 
we must be cautious because the experimental or field data 
under examination have often been gathered under complex 
conditions. Complex studies, in which many (and sometimes 
unknown) variables are at work, allow only general inter- 
pretations and force any detailed analysis to be too speculative 
for scientific comfort. The details of data may resist analysis 
even when we discern in them the operation of some underly- 
ing principle that we know well. 

Social Behavior in Animals 

Behavioral interaction may be observed at the biological 
level of single-celled creatures. The plant Volvox, for ex- 
ample, lives in a colony to which the offspring remain con- 
nected until they break away to form their own aggregations; 
and Paramecia, which are normally solitary animals, come 
together for the purposes of sexual reproduction. At this level, 
actions probably depend on physico-chemical changes that 
function exclusively in an eliciting fashion. 

Higher in the animal scale, insects have long provided us 
with complicated examples of social interaction. Everyone 
has heard of the division of labor and specialization of func- 
tion which characterize ant and bee communities. Some 
species of ants even protect and rear other organisms (aphids) 
within their nests, and 'milk 1 them later for the nutritive 
liquid they secrete. Much of this and other insect behavior is 
probably elicited respondent-wise. Nevertheless, insects can 
also learn in Type R fashion. We know that they may be 
taught to run simple mazes for food and to escape from 
noxious stimuli; and it may be that social stimuli, such as 
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those involved in return to the nest, may also act as operant 
reinforcers. 

Birds provide further examples of socially mediated rein- 
forcement. Many species habitually live in flocks, but even 
where this is not the case, they may form groups under certain 
conditions, as at times of migration. Seasonal changes in tem- 
perature and in diurnal illumination produce bodily effects 
(e.g., in the sex organs) that raise the activity level. The 
gathering of a migratory group at given locations, the forma- 
tion of typical flight patterns in the air, the flight leadership 
by older and more experienced birdsall of these probably 
involve operant-respondent overlap, but social interaction is 
obvious. Most birds show a degree of care for the young, and 
they may prolong this care beyond the time when the off- 
spring are capable of independent flight and self-support. 
Even the songs and call-notes of birds may be in part a prod- 
uct of social control. In the oft-cited experiment by Scott 
(1901), young orioles were separated from their parents and 
raised in isolation, so that they would not be exposed to the 
normal oriole song, with the result that a new kind of song 
appeared. Conradi (1905), later, placed sparrows with canaries 
and observed that the sparrows tended to reproduce the 
canary calls. In such creatures, there may be a natural ten- 
dency to echo sounds, but even if there were not, it is possible 
that, through socially-given differentiation training, such 
behavior might arise, with the parent bird providing food 
for the young when certain sounds were approximated. In 
this way, a young stranger in the nest might gradually be 
taught the 'language' of his foster parents, even as we do 
with our own children. 

Social behavior is, of course, found in extraordinary rich- 
ness among mammals, particularly the primates. Habitual 
living may be in herds, flocks, and packs; family units are 
often clearly defined; and sexual attachments may be mo- 
nogamous or polygamous, seasonal or perennial, and some- 
times even homosexual. One has but to think of domesticated 
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animals like the dog and horse to realize that social stimuli 
may become powerfully discriminative, reinforcing, and 
drive-arousing for sub-human behavior. Monkeys, baboons, 
and the anthropoid apes have been studied in some detail by 
psychologists and zoologists, and much is known about their 
actual or potential social activity. Investigators all attest to 
the strength of friendships and enmities among these animals, 
and to various other well-delineated inter-personal relations. 

The exigencies under which social stimulation controls 
animal behavior are numerous. Sexual needs give reinforcing 
value to the mate who mediates the reinforcement. Parental 
care, especially among animals in which the young go through 
a long period of dependency, also favors the development of 
social S D 's and S r 's. Huddling behavior, whereby physical 
warmth is provided, may lead to the same end. Even food- 
getting may give rise to behavioral control by such stimuli, 
as when predatory animals hunt in packs and work together 
in running down their prey. 

In a famous experiment by the Kelloggs (1933), a seven- 
months-old chimpanzee, Gua, was brought into their family 
and raised with a ten-months-old child of their own, Donald, 
during a period of nine months. Every attempt was made, 
during this period, to treat the youngsters in identical fashion. 
They lived and played together like any ordinary pair of 
children, even as twins might; they wore the same clothes, 
followed the same daily schedules, ate the same foods, slept in 
similar beds, were spoken to in the same way, and so on. The 
'children' accepted each other fully and affectionately; and 
Gua's development testified clearly to the potentiality of 
chimpanzee behavior. She was hardly to be outdone by 
Donald in a number of activities. She acquired, for example, 
a good 'passive' vocabulary, reacting consistently and correctly 
to many words used by the Kelloggs. In other respects, she 
definitely surpassed Donald, as in her climbing ability. The 
experiment could not, of course, be continued indefinitely, 
since the child had clearly less to gain from the association; 
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but it provides us with a clear indication of almost-human 
social capacity at the sub-human level. 

Sample Analyses of Social Interaction 

Of the several types of social relation among animals that 
have received the attention of psychologists, we may select as 
examples for our closer attention those of dominance, co- 
operation, and imitation. 

i. Dominance. An animal may achieve dominant status 
over another by virtue of the actual or threatened negative 
reinforcement it applies. In competition for food, mates, 
sleeping quarters, or whatever, the dominant individual wins 
out by lowering response strength in a rival through punish- 
ment. He may thus become a strong secondary negative rein- 
forcer and S D , to the extent that a mere gesture will lead the 
submissive animal to beat a hasty retreat. Schjelderup-Ebbe 
(1935) has noted that dominance may often be established at 
the first meeting of two birds, with the dominant animal 
driving away or pecking the other at will. The avoidance and 
escape reactions of submissive birds may persist even when the 
dominant birds have become old and weak, if only the battle 
posturings of the latter maintain their S D and S r functions. 
The fact that submissive animals sometimes 'rebel' is con- 
sistent with our knowledge of the impermanence of the effect 
of punishment; and we may suspect that rebellions are more 
likely to occur when motivation is high and the dominant 
animal blocks the way to reinforcement like food or a mate. 

Dominance through physical force has also been observed 
among baboons by Maslow (i936a, igsGb) and among rats by 
Seward (1946), and has been seen to involve stimulus general- 
ization. A badly-thrashed animal will transfer his submissive- 
ness to other animals than the one that first defeated him. A 
single thrashing, if severe enough, will bring about submis- 
sion to all comers, even when the loser in a fight has pre- 
viously been the champion. 

Dominance-submission relations, it is often said, may be 
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based upon other factors than general physical strength. This 
is true, since the application of punishment by one individual 
is not related solely to over-all physique. The determining 
factors are the histories of the interacting organisms. Thus, 
an old bird, or a relatively feeble young one, may be dominant 
by virtue of the submissive one's experience with similar 
war-like stances assumed by a former victor; and a nimble or 
resourceful ape may dominate a stronger one because he is 
nevertheless more adept in applying punishment. Circum- 
stances like these are not far different from those of human 
experience. However, there is no reason why, in human as 
well as in animal life, dominance-submission need not be 
based on a combination of positive and negative reinforce- 
ment, or on positive reinforcement alone. Any definition of 
dominance is arbitrary. In some animal species, for example, 
a normally submissive female may gain marked control, in 
mating season or at other times, because of sexual changes in 
the male which raise her reinforcement value for him. Or, if 
a person's own efforts often result in failure and there is 
someone who unfailingly supplies rewards, independent 
efforts may cease and excessive dependence on the willing 
benefactor may develop. Furthermore, the dominance of one 
individual and the submission of another may be specific to 
certain situations and actions, in accordance with the histories 
of both individuals. It is not always the same man in a college 
class who dominates in both the classroom and the gym. 

2. Cooperation. Cooperation, like dominance, is not a 
single response entity, but denotes many kinds of responding 
in many types of situation. It may be defined as the case in 
which the combined behavior of two or more organisms is 
needed to procure positive, or remove negative, reinforce- 
ment for either. Two organisms may perform the same act, 
like hauling together on a rope to obtain food, or different 
acts, as when one steps on a treadle to open a door while the 
other pulls in food. This sort of cooperation is easily obtained 
in human adults, by verbally instructing them in their roles 
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or by letting them discover the efficiency of joint action. With 
animals, as well as young children, we have to train them 
from the start and can, therefore, observe how cooperation 
develops. The effectiveness of verbal instruction in the human 
acfult depends, of course, upon a long history of training, from 
childhood on; and, even with us, cooperative endeavor in 
some activities is often achieved only with difficulty some- 
times never. In any case, whether with human beings or ani- 
mals, and whatever the task, cooperation involves two things: 
(i) each organism's action must be discriminative for the 
other's performance; and (2) each organism must be rein- 
forced for the part it plays in the cooperative scheme. 

The investigations of Daniel (1942, 1943) are good illustra- 
tions of the way in which cooperation is acquired by animals 
difficult to train in such behavior. In his studies, rats were 
first taught to obtain food from a dish on the floor of their 
experimental chamber, and then to turn off an electric shock 
from the grid floor by climbing on to a small platform. The 
platform was eight inches from the food dish, making it im- 
possible for the animal to eat and avoid shock simultaneously. 
When the two responses had been independently well estab- 
lished in each rat, training in cooperation began. Two hungry 
animals were placed in the chamber at the same time, with 
the floor electrified and food in the dish. Thus, one rat was 
able to eat without being shocked only if his partner stayed 
on the platform. If both went to the food dish, both were 
shocked. Experimentation took place during a forty-day pe- 
riod. As training progressed, Daniel observed that the rats 
came to exchange positions with considerable regularity, and 
with such efficiency that both got plenty to eat and were able 
to avoid shock on 94-99 per cent of all the times that either 
animal left his position. Seldom did they waste feeding time 
by sitting on the platform together; and more and more their 
behavior came to be directed by the stimulation they received 
from each other. Keeping at least one foot on the platform, 
the shock-avoiding rat would nudge, paw, or even bite the 
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tail of the feeding rat until the latter left the dish and gave 
him his 'turn* at eating. 

A sketchy analysis of such cooperative behavior might run 
as follows. The platform rat, though hungry, stayed in place 
because shock was thereby avoided. As his anxiety abated, his 




FIG. 75. Plan of the apparatus used by Daniel (1942) in studying 
cooperative behavior in rats. (After Dashiell, 1949.) 

food-getting behavior became prepotent and he attempted to 
produce the S D for approach to food to bring the other rat 
beside him on the platform. His nudgings, pawings, and bit- 
ings served in turn as anxiety-arousing stimuli for his partner, 
since they had in the past been followed by electric shock if 
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he delayed in returning to the platform rat's side. The only 
escape from this stimulation for the feeding rat was, of course, 
by leaving the food and going to the platform; whereupon the 
platform rat, having its S D for 'safety/ was free to eat, and the 
entire process was repeated with the animals in reversed posi- 
tions. All that was required to maintain the subsequent cycles 
of exchange to keep the controlling stimuli effective was an 
occasional shock to both rats resulting from a failure of one or 
the other to 'play his part/ 

In the above experiment, cooperation was based primarily 
upon hunger and shock-aversion. In his second study (1943), 
Daniel tried to see what would happen if, after coopera- 
tive behavior of the above kind was established, the shock 
was systematically reduced until none was ever used. In this 
experiment, an electrically operated cap was placed over 
the food dish so that an animal had to be on the platform 
in order for the cap to be lifted and the food exposed for 
eating. The question was whether the animals would con- 
tinue to 'take turns' at platform and dish under these 
conditions. They did not, and it is not difficult to see why. 
The anxiety that caused one rat to return to the platform 
on being nudged by the other is no longer present. The 
only motive at work now is hunger, and the only reward for 
going to the platform and away from the dish is the chance to 
eat later if the partner cooperates in the same way. The rein- 
forcement for going to the platform is, at best, long delayed, 
and stimuli like nudgings merely signal to the eating animal 
that food may soon be withdrawn. This kind of "higher 
order" cooperation, involving long delay of reinforcement, is 
apparently very difficult, if not impossible, to obtain in a rat, 
whereas it is characteristic of many everyday human activities 
such as budgeting one's income in order to save for desired 
things like a car or the family's own home. Nevertheless, you 
can see that cooperative behavior at all levels of organismic 
life follows our basic principles, and that the functions of 
social stimuli are the same as those of non-social. 



SOCIAL BEHAVIOR 361 

3. Imitation. Our third illustration of social relations 
among animals is that of imitation, and it, too, can give us an 
opening into human social behavior. We need not go into 
the centuries-long discussion of imitation. It is enough to say 
that, for a long time, many outstanding men like Charles 
Darwin and William James believed that imitation was 
simply a natural or "instinctive" thing. Early social psycholo- 
gists like Gabriel Tarde (1890) regarded imitation as a funda- 
mental trait of behavior, and used it as an explanatory prin- 
ciple in itself when discussing the regulation of human custom 
and the development of human society. Experimental at- 
tempts to measure "unlearned" imitation in man or animals 
never gave an unequivocal answer, and there has been an 
increasing reluctance on the part of psychologists to appeal to 
imitation as the explanation of any case of one animal's 
duplicating the behavior of another. They feel that be- 
havioral duplication can arise from many causes, so that not 
all cases ought to be classed together. Moreover, not all be- 
havioral duplication is "imitation," since there are many 
duplications no one would be inclined to class as imita- 
tion; for example, people walking to the morning bus, or a 
defensive fullback chasing the ball carrier. 

Whether or not there is any such thing as unlearned or in- 
nate imitation, one thing is sure. Imitation can be taught. 
Through the use of conditioning procedures by making rein- 
forcement contingent upon repeating another's actan organ- 
ism can be made to imitate. An act of one organism is made 
the S D for an act of another, so that the action of the leader 
must be repeated if the follower is to get his reinforcement. 

As an example of this leader-follower relation, we may cite 
an experiment by Miller and Bollard (1941) who worked with 
both human and animal subjects. Once more, we choose an 
animal illustration because of its relative simplicity and 
dramatic clarity. The animal (a rat) who was to act as leader, 
was first and separately taught to run down the stem of an 
elevated T-maze, and at the choice point to turn down the 
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arm marked by a black card. In the usual discrimination 
training procedure, the position of the white and black cards 




FIG. 76. The apparatus used in the Miller-Dollard studies of imita- 
tion. The stem of the T is 18 inches long and the running surface is \Ys 
inches wide. The gap to be jumped at the choice point is 434 inches 
in either direction. The food-cup for the leader is 161/3 inches from the 
gap; for the follower, 8i/ inches and is covered by a hinged cap which 
is removed after the leader has passed over it. (After Miller and Dollard, 



was exchanged in random order from trial to trial on the arms 
of the T, but a turn toward the black card (S D ) always brought 
the animal to food, whereas a turn to the white card (S A ) went 
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unreinforced. After the leader had mastered this discrimina- 
tion well, a potential follower (a rat without any previous 
training, but hungry) was placed directly behind him on the 
stem of the T. The follower received reward only if he turned 
into the same arm of the T as did the leader; if he turned into 
the opposite arm, he got no food. The experimental results 
left no doubt that imitative responding was learned: the re- 
sponse of the leader became an S D for the follower who came 
practically always to make the same turn he did. This finding 
was not dependent on accidental cues from the black and 
white stimuli reaching the follower, as proved by the produc- 
tion of imitative responding without cards being present at 
all. For this check, some leaders were trained always to turn 
right in the T, others always to turn left. The tested animals 
were well-trained followers, but had learned their following 
with leaders who had themselves obeyed card cues. With the 
new leaders who required no cards, and who sometimes were 
right-turning animals and sometimes left-turning animals, 
the followers continued to use the leader's response as S D . 
You will not have missed the added implication of this finding 
that leaders will generalize for followers, so that a well-trained 
follower will trail after many leaders indiscriminately! In the 
light of recent and tragic historical examples we would hardly 
care to train into our children such discriminative depend- 
ence upon others or the generalized tendency to follow any 
would-be leader who happens along. 

Human Behavior and the Cultural Environment 

The community in which men and women live makes up 
an important part of their total environment. It is difficult, if 
not impossible, to think of a single operant activity of the 
individual which does not show in some degree the pervasive 
influence of his community's teaching. The full recognition 
of this we owe to the efforts of anthropologists and social ps> 
chologists who have devoted themselves to the study of the 
form and content of human societies around the world, and 
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to the interaction between the individual and his cultural 
environment. From earliest times, men knew, and were inter- 
ested in, the fact that neighboring or distant communities 
existed that were different from their own, and that people 
behaved differently in those communities. During the nine- 
teenth and twentieth centuries, anthropology developed as a 
social science, an important sector of which came to be known 
as comparative ethnology, or the comparative study of human 
behavior and customs in different societies. From such study, 
we have been led to discard some old conceptions and atti- 
tudes towards human communities. 

The new conception may be called cultural relativity. 
Where once we were inclined to think of other peoples as 
strange, backward, and inferior to ourselves, the conclusions 
of anthropologists lead us to think otherwise. They point out 
that even the so-called primitives think the same things about 
us. In some societies, even the word human is limited to one's 
own group, while other peoples are described as "non-men" 
or "sub-human." Furthermore, the primitive is not, as was 
once thought, childlike in his simplicity and illogical in his 
reasoning; in fact, he thinks the same things of us and easily 
picks out weaknesses in our culture and inconsistencies in our 
social behavior. Nor does he concede any ethical superiority 
to us. There are instances, for example, where he has re- 
proached us for our wholesale slaughters in war and has 
offered to send missionaries to teach us how conflicts among 
men may be settled without bloodshed. He is amused by some 
of our customs, and revolted by others. Sometimes he is im- 
pressed by our gadgets and material achievements; or he may 
be singularly unmoved by tall buildings and ships, or air- 
planes, pointing out that such things, while perhaps useful, 
are no guarantee of a peaceful and abundant span of years. 
All in all, the comparative study of cultures leads us to con- 
clude that such differences as exist among human groups do 
not form a basis for making value judgments concerning 
them. Differences in customs, folk-ways, manners, religious 
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practices, ideals, beliefs, and so on, are simply differences in 
the way men may grow up in diverse physical environments, 
out of diverse historical streams, and out of a total potential 
repertory of behavior that is far greater than any one culture 
can explore to the full. Each human society is to be ap- 
proached in a spirit of objectivity and accorded the tolerance 
(which we would wish for ourselves) that comes from an un- 
derstanding of its origins, values, and methods of molding 
the lives of its members. 

Understandably enough, this contribution of anthropology 
has, in its dissemination, sometimes been misconstrued. The 
underlying conception of man's alikeness must not be lost in 
the recognition of differences. What men learn in different 
societies ought not to obscure the basic facts that all men learn 
in the same way and are subject to the same drives. The prin- 
ciples of reinforcement, extinction, discrimination, and the 
like, operate universally, though the form of the response to 
be learned, or the special type of S D to be obeyed, may be 
selected by the community. Similarly with the motives of 
men: all men eat, though they differ in how and what; all 
men drink, perform sexual acts, and breathe. Further, all men 
have similar eyes and ears and noses for the reception of 
exteroceptive stimuli, though they may be taught to respond 
differently to those stimuli, through conditioning procedures 
that are the same for everyone. We need to hold onto the idea 
of human absolutes as much as to that of cultural relativity. 

The cultural environment (or, more exactly, the members 
of the community) starts out with a human infant formed and 
endowed along species lines, but capable of behavioral train- 
ing in many directions. From this raw material, the culture 
proceeds to make, in so far as it can, a product acceptable to 
itself. It does this by training: by reinforcing the behavior it 
desires and extinguishing others; by making some natural and 
social stimuli into S D 's, and ignoring others; by differentiating 
out this or that specific response or chain of responses, such as 
manners and attitudes; by conditioning emotional and 



366 PRINCIPLES OF PSYCHOLOGY 

anxiety reactions to some stimuli and not others. It teaches 
the individual what he may and may not do, giving him norms 
and ranges of social behavior that are permissive or prescrip- 
tive or prohibitive. It teaches him the language he is to speak; 
it gives him his standards of beauty and art, of good and bad 
conduct; it sets before him a picture of the ideal personality 
that he is to imitate and strive to be. In all this, the funda- 
mental laws of behavior are to be found. 

Personality and History 

While all human beings obey the same laws of behavior, 
each individual ends up with a unique behavioral equipment 
that defines his 'personality/ The emergence of uniqueness 
from uniformity is possible because of the variations in cir- 
cumstance under which the basic processes are worked out. 
The uniqueness stems from the reinforcement and motiva- 
tional history of the individual which is different from that 
of others. We may think of the individual as a special (and 
very large) set of parametric values of basic functions. Thus, 
persistence in the face of failure may vary greatly from one 
person to another, depending upon the amount and variety 
of periodic reinforcement one has had in the past. Again, one 
may be more or less dominant or submissive in the presence 
of friends and strangers, depending upon his experience with 
other people. 

The fact that the individual's present personality is related 
to his biography leads to an interest in behavioral develop- 
ment from birth through infancy, childhood, adolescence, 
adulthood, and old age. Students of psychopathic behavior 
know the importance of the early years in forming habits that 
may last for life. In tracing behavioral disorders of adults they 
are often led back to childhood experiences as the sources of 
instability and disturbance. The importance of biography has 
long been known by politicians, fanatics, and all manner of 
special-interest groups, each of whom, in seeking to manage 
society, sought either to control the family in which the child 
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grows up, or to gain outright possession of the child. The 
tenacity of early training has been made the topic of bitter 
wit, as when William James said, "People often think they are 
thinking, when they are merely re-arranging prejudices"; or, 
as a student once wrote in a term essay, "A college education 
does not eradicate prejudices, it merely makes them more 
subtle/' He who would control society must ultimately reach 
the youth, and there is psychological soundness in the old 
slogan, "Give me the child until seven, and you can do with 
him as you wish thereafter." We need not agree with the com- 
plete pessimism of this opinion, nor vouch for the age seven, 
but the idea is not without basis. 

Consider the "Oedipus complex" of psychoanalysis. The 
general idea, as the reader probably knows, is that the son 
tends to rebel against his father, and wishes to overcome or 
destroy him, because the father is a powerful and pre-empting 
rival for the affection of the mother whom the son covets. This 
competition with, and fear of, the father colors the whole 
psychological development of the child, and may be reflected 
in many ways in the adult character. The Oedipus relation 
was thought to be an innate, motivating reaction of the son 
to the father. This view of the father-son relation seems, how- 
ever, to arise within families in which the father is the dis- 
ciplinarian who applies negative reinforcement. Study of 
other kinds of family organization, and in some non-European 
cultures, shows that the father-son relation can be quite differ- 
ent (Malinowski 1929, Kardiner 1939, Linton 1945). In some 
societies, the father has little or no disciplinary jurisdiction 
over the child, and acts mostly as a beneficent supplier of com- 
forts and affection. It may be the mother's brother who acts 
as head of the family and administers discipline. In this case, 
we would have an "uncle-Oedipus complex." There is little 
reason to resist the conclusion that the personality develop- 
ment of the individual is a function of his own conditioning 
history. 

After the time when the role of the family is at its peak, 
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formative influences continue to work upon the maturing 
child through his friends and his school. The currents and 
counter-currents of conditioning and extinction that stem 
from social origins do not stop. Joined with the reinforcement 
history of an individual are such factors as his general health, 
physique, personal appearance, social and economic position. 
Other things equal, a strong and healthy child is more likely 
than a weak and sickly one to develop physical self-reliance 
through the success of his running arid jumping and climbing 
and pushing. In our society, where 'good looks' are highly 
prized, a handsome child can quickly acquire social suavitv 
and self-assurance (or a 'spoiled brat' character) because of the 
partiality and indulgence shown by his elders. Persons of high 
socio-economic status are often the victims of obeisance and 
flattery and what is flattery but the indiscriminate use of 
positive reinforcement? Obstinacy and arrogance may be the 
result. The 'cute* child who is encouraged again and again to 
recite, to tell jokes, or dance, may continue to do so as long as 
he wins approval, until he ends up making his living at it. 
With such examples, we do not, of course, intend to over- 
simplify the factors that contribute to any personality. The 
phrase "other things equal" seldom holds outside the labora- 
tory where things are kept equal by experimental devices. It 
must be admitted at once that the detailed analysis of any 
personality is an extremely complex matter because of the 
multiplicity of co-acting variables, past and present. In broad 
outline, nevertheless, we can see how such variables exert 
their force by determining when reinforcement and extinc- 
tion will be applied, how often, and by what schedule, and 
similar questions. 

Of great importance to the formation of personality, is the 
fact that human beings can discriminate their own actions, 
appearance, feelings, and successfulness. In the course of 
growing up, the child comes to 'know' about himself; he be- 
comes at least partially 'aware* of his capacities and weak- 
nesses, his likelihood of winning or losing in given situations, 
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his physical and social attractiveness, his characteristic reac- 
tions. This is sometimes spoken of as the development or 
emergence of the "Self," a word that is meant to designate 
the ability to speak of (be "aware" of) one's own behavior, or 
the ability to use one's own behavior as the S D for further 
behavior, verbal or otherwise. The sociologist Mead spoke of 
the "Self" as a social product, that is, it arises out of social 
interaction; but more specifically, we can say today that the 
individual is taught by his fellows to respond discriminatively 
to himself and his behavior. He can observe himself and judge 
himself with words like "good" and "bad." He can estimate 
his own efficacy as a social agent in pleasing people and in 
striving for social success; and if he discriminates what in his 
behavior is causing failure, he may switch to new responses, 
that is, "improve" or "snap out of it." The "Self," in short, 
is the person, his body and behavior and characteristic inter- 
actions with the environment, taken as the discriminative 
objects of his own verbal behavior. They are made discrimina- 
tive for him by his social community, as it teaches him his 
language, because (a) his behavior is itself important to the 
community, and (b) it is also important to the community 
that he 'know' about his behavior. By important, here, we 
mean that the smooth living of both the individual and the 
community depends on his conduct toward others as well as 
his discriminative self-control. 

We can deduce two things from the fact that the "Self* 
arises out of discrimination training and out of verbal be- 
havior. First, the child starts out in life without a "Self," and 
must build one up through stages of dim realization by a 
continuous learning process. "Self-learning probably never 
ends, nor is it ever perfect. There are times in life when new 
requirements are imposed upon the individual and new be- 
havioral possibilities open up to him for exploration. At such 
times, the "Self" may come in for searching examination, and 
may grow rapidly as the individual recognizes new talents and 
capacities; or it may suffer as the individual fails to measure 
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up to what is expected of him by the community. Thus, in 
adolescence, as we usually consider it, we see a transition 
(often attended by social rituals and ceremonies) from child- 
hood to adulthood, with the consequent expectation that die 
person will thereafter exhibit the bearing and behavior of an 
adult, and take on the adult's responsibility for his own be- 
havior. Yet the adolescent does not know what an adult has 
to be and do, and the learning of it is not easy in a society that 
does not planfully allow its children to assume adulthood by 
graduated steps. In our society, we speak of "adolescence" as 
though there could be only one, and that combining both 
physical and social maturation. In truth, however, we prolong 
social adolescence long after sexual adolescence is complete, 
and this creates problems not necessarily encountered in 
societies that recognize the difference. Other societies may 
have many "adolescences," or transitions between recognized 
stages of social responsibility, rather than a single leap from 
childhood to adulthood. But, regardless of the number of 
transitions, at each one there are newly learned increments to 
the "Self," as well as some sloughing off of old parts; and while 
the emergence of the "Self" is expected to come in large steps 
at these official transitions, there is in fact a continual and 
progressive flux in self-discrimination going on at all times. 
When the process is arrested or seriously distorted, society 
may judge the individual's lack of self-control or self-knowl- 
edge as pathological. Psychotherapy is then brought to bear 
upon him in the hope that, with special tutoring, the growth 
of the socialized "Self" may be resumed. 

A second deduction: a person possessing no verbal behavior 
of any sort would not have a "Self," or any 'consciousness/ 
His reactions to the world would be like those of any animal, 
though he might be more "intelligent," that is, he could learn 
more things and faster than could lower animals. He would 
go after positive reinforcements, and would avoid negative 
reinforcements, but would do so directly, without "reflec- 
tion." To ask whether he "feels" this way or that, whether he 



SOCIAL BEHAVIOR 371 

"knows" that he is being pained, whether he "realizes" that 
what he has done is good or bad, and so on, is as idle as to ask 
the same question of a rat. Without verbal behavior which is 
discriminatively conditioned to come out (i.e., to "describe," 
or "report" or "introspect") at the occurrence of stimuli, or at 
the occurrence of some of his own behavior or behavior ten- 
dencies, there is no "conscious awareness" and no "Self." It is 
not that he possesses these things but cannot speak of them; 
it is simply that the question of whether he possesses them is 
meaningless. They are the products of verbal behavior, not 
the causes. In the last analysis, the "Self" and "consciousness" 
are the creations of human society operating on the individual 
by means of verbal training. 

In passing, it is perhaps worth noting that because the 
"Self" is a constellation of discriminative operants, it is under 
some motivational control. We pointed out in Chapter 5 that 
high motivation may distort discrimination to a point where 
a discriminative response may be emitted in the presence of 
remotely similar stimuli or even in the absence of any stimu- 
lus. Some writers speak of this as "autism" or "wishful think- 
ing." We see 'wishful thinking* when a person speaks of (i.e., 
responds to) his "Self" as though he had traits he does not 
really possess. Thus, he may say of himself "beautiful, witty, 
and wise," when he possesses, not these features, but only 
the desire! 

Consistency and Integration of the Personality 

Because personality is the outcome of reinforcement his- 
tory, and because an individual is capable of behaving in 
many ways, there arise the two problems of consistency and 
integration. Behavior is always consistent with the laws of 
behavior, but not always with logic. A man may act in two 
logically inconsistent ways with respect to the same subject 
matter, but if he does so it is because he has been separately 
reinforced for both ways of acting. Thus, in one company, a 
man may decry greed or gossip or intolerance; yet at another 
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time, in another setting, he may practise any one of them. He 
has been reinforced for speaking one way, and for acting 
another. It is as if different response systems, each capable of 
separate reinforcement and extinction, exist within a single 
person's behavioral repertory, with the several systems operat- 
ing in ways that may be consistent or inconsistent as judged 
by some outside logical criterion. Oftimes, when an individ- 
ual has had an inconsistency in his behavior or attitudes 
pointed out to him, he will summon reasons which we suspect 
are not the real ones but are only "rationalizations for his 
inconsistency. Rationalizations are usually unsuspected by the 
one who uses them, but serve as effective smoke screens for 
the real causes of his conduct. 

Overlapping the question of consistency is the broader con- 
cept of integration of personality, or that harmonious func- 
tioning of all the response systems within the individual 
which is necessary for good adjustment to his social and nat- 
ural environment. Persons who are psychologically "normal" 
have a certain oneness about their behavior, a continuity or 
wholeness, that indicates coordination and cooperation (not 
necessarily perfect) among their several response systems. 
They seem to be "in touch with" most of their behavior ten- 
dencies or attitudes, in the sense that they can speak of them 
on demand, or be taught to recognize them. Integration is not 
a clearly defined behavioral concept, but it figures impor- 
tantly in the way clinical psychologists and other students of 
personality think about the organization and functioning of 
an individual's reaction repertory. In one form or another, 
clinical evaluations of personality are likely to include ob- 
servations upon the harmony of interaction of whatever 
"segments" or response systems have been identified in the 
personality. An integrated personality does not exhibit re- 
sponses that are severely out of joint with the circumstances 
in which the individual finds himself, or out of joint with one 
another; nor is the individual entirely unaware of the consis- 
tencies and inconsistencies among his response tendencies. 
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There is a thread of continuity and self-control running 
through integrated behavior that makes one feel that he is 
dealing with a single, articulated, unitary person. On the 
other hand, personalities said to be poorly integrated give the 
opposite impression. In them, it seems as though the response 
systems have no contact with one another; as though there 
were no over-all direction to the individual's behavior that 
gets his actions to cohere, or to correspond with what is called 
for in a given situation. Thus, a patient with hysterical symp- 
toms, such as functional paralysis, is said to suffer from im- 
paired integration; that is, his bodily and behavioral segments 
seem to be going along independently, without relation to the 
personality's unity. Again, pathological cases of multiple per- 
sonality, amnesia, somnambulism, and the like, represent 
failures of integration. It is difficult to convey the intention 
of the concept of "integration" because it is not a rigorously 
definable term. Yet clinicians and psychiatrists find it useful, 
and you would do well to become acquainted with it if you 
expect to read further in the field of abnormal psychology. It 
should be made clear, however, that "inadequate integration 
is a developmental defect" (Shaffer, 1936, page 384). We 
might add, also, that it may show up only when some precip- 
itating circumstance befalls the individual (such as emotional 
shocks, severe negative reinforcement, or threat of punish- 
ment) that he cannot cope with. At such times, personality 
may disintegrate, producing the neurotic and psychotic mal- 
adjustments that are the specialized subject matter of ab- 
normal psychology. 

Social MotivesAre There Any? 

The heading of this section is put as a question, because 
that is probably the best way to approach the matter of social 
motives. The answer must depend on the elementary con- 
siderations about drive that occupied us at the opening of 
Chapter 9. There we saw that the concept of drive has three 
sources: (a) an establishing operation; (&) the effects of that 
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operation upon the momentary strength of reflexes apart from 
further operations of reinforcement; and (c) the possibility of 
reinforcement that the establishing operation creates. It is 
against these criteria that the student must weigh the status 
of any proposed "social" motive. To put the question a bit 
differently, we ask what is intended by the adjectival use of 
the word social as applied to motives. 

On taking thought, the student ought to come to these con- 
clusions: (i) A motive may be called social if its establishing 
operation and appropriate reinforcement involve the with- 
drawal or supplying of social objects or stimuli. Thus, a 
female rat separated from her litter for a time may have a 
response conditioned if return-to-litter is used as the rein- 
forcement. Some have argued that the social aspect of this mo- 
tive is incidental to the underlying factor of pain in the filled 
mammary glands which is alleviated by the draining of milk 
by the unweaned litter, and therefore that the sociality of the 
motive is specious or entirely derived from a non-social drive. 
Counter-arguments have likewise been made against other 
proposed social motives such as gregariousness and sex. With 
further thought, however, you will not fail to see that (2) the 
distinction between drives as social and non-social is not a 
critical one. Classifying drives as of one type or the other is 
not nearly so important as recognizing that many commonly 
supposed social drives are not drives at all. 

When we speak casually of a person "striving for," or 
"desiring," this or that, it is plain that we mean there is some 
motive present. The layman's error, however, comes in his 
assumption that the things striven for identify the motives. 
Take, as examples, the desires for prestige and social approval, 
which the layman might propose to include among his social 
motives. The reflective student will sooner or later ask himself 
what are the establishing operations and drive-reducing rein- 
forcements for the "prestige motive" or "approval motive"; 
and, failing to see an answer, will wisely decide that they are 
doubtful candidates. The many forms that prestige and ap- 
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proval takeawards, smiles, words, deference, invitations, 
money, and so forth are enough to make us wonder in the 
first place how such diverse items ever came to be grouped 
together. It is likely that they are all secondary reinforcements 
because of their correlation with positive primary reinforcers, 
or because they stave off anxiety about possible punishments. 
Stripped of these correlations, the tokens of prestige and ap- 
proval become empty, and ineffective as controllers of be- 
havior. The tokens, furthermore, probably serve as the S D 's 
for the next things to do in our enormously complex chains 
of social activity, since they indicate what behavior, successful 
in the past, is worth doing again. Here, too, when the behavior 
occasioned by past success is no longer reinforced secondarily 
or primarily, the tokens extinguish as do any S D 's that are no 
longer correlated with reinforcement. 

Other candidates for the position of social motive, such as 
the popular one of "mastery," are subject to the same reserva- 
tions mentioned above. In each case, the difficulty of finding 
the establishing and reinforcing operations, as well as the 
infinite variety of forms the "motive" takes, makes us reluc- 
tant to accept them. Concerning dominance, cooperation and 
imitation, enough has already been said to indicate that none 
of these can be subsumed under the drive concept. 

Perhaps a word ought to be said about romantic love as a 
possible social motive since it figures so large in our litera- 
ture, art, and folklore. It is certain that two persons in love 
have more between them than the sex drive alone, since for 
the latter any partner would serve; and an aged couple may be 
in love when sex is no longer a factor in their lives. Yet what 
it is, and even how it is recognized by lovers, cannot easily be 
put down either introspectively or by direct examination of 
specimens. Nevertheless, many people unhesitatingly call love 
a motive, partly because of our literary tradition that tells of 
its force. We may doubt that love is a motive, although the 
behavior involved still interests us. The dramatic case is that 
of "love at first sight," and we can make some guesses about 
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it. Here, each lover at once presents to the other an array of 
discriminative, reinforcing, and drive-arousing stimuli that 
are effective for him. These stimuli have a history, and there 
is undoubtedly a large element of stimulus generalization or 
transference involved. The more of these each lover embodies 
for the other, the deeper the attraction, until the ideal of the 
story-teller is reached in the all-inclusive love. Thus, the bond 
that unites lovers may be tied at their first meeting, but its 
strands were aweaving long before. The matching of life- 
histories in this way is bound to be a rare event in human 
experience, but it need not happen often for the ideal to 
suggest itself. Less dramatically, love may be the outcome of a 
felicitous life spent together. In the course of time, each of a 
pair comes to possess positive stimulus values that may not 
have been present at the start and that give rise to mutual 
devotion and dependence. 

In either event, however, the stimulus roles played for each 
other by lovers are basically the same. Always, of course, the 
continuance of these stimulus functions in love depends on 
the ultimate primary reinforcements provided or shared by 
lovers, including food and drink, play activities, rest and 
sleep, reduction of anxiety, and the relief of the sex drive. But 
whatever true motives become involved in it, love is not itself 
a motive. 

VERBAL BEHAVIOR 

Introduction 

No account of human behavior can be complete that over- 
looks man's verbal activity. It is his highest and most valuable 
form of behavior; more than any other, it distinguishes him 
from lower animals; in it are treasured up man's cultural 
heritage of philosophy and science and art and technology, 
and by it the transmission of this accumulated knowledge is 
effected from generation to generation. Indeed, verbal be- 
havior has made such knowledge possible. The layman may 
take his verbal behavior for granted, but to the scientific mind 
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the forms, functions, and influences of language constitute 
phenomena of the grandest magnitude. 

The analysis of language's forms and functions has been 
approached from many angles in years past. One after an- 
other, the grammatical, linguistic, semantic, and still other 
approaches have sought to clarify the origins and develop- 
ment of language, in the belief that thereby they could clear 
away obstacles to thinking, avoid needless disputes of under- 
standing, make for better communication among men, and 
provide a better basis for dealing with human behavior. All of 
these varied attempts have in common a fundamental assump- 
tion, or set of assumptions, about man and his language which 
the science of behavior finds quite unacceptable. This assump- 
tion holds that a man has "ideas" which he "expresses" in lan- 
guage "in order to communicate them to another person; his 
language is simply a medium wherewith he clothes his ideas so 
as to project them through space to someone else who, hearing 
the words, "interprets" or undresses them back into the under- 
lying "ideas." The assumption is that a man is an agent who 
manipulates words finds the "right words" for purposes of 
communication, and that the words are indicators of ideas 
outside themselves. Such views are essentially mystical, as well 
as logically circular. In explaining verbal behavior, such views 
assert the existence of things which cannot be either proved 
or disproved, and for which the only evidence is the very 
language behavior which the things are invented to explain 
in the first place. The dual classification of man into "mental" 
and "behavioral" components has been a stumbling block for 
theories of both language and behavior. Scientific psychology 
has profitably discarded this dualism. 

You will probably find the problem of verbal behavior the 
most difficult one in this text. The mind-body dualism is 
deeply ingrained in our habits of thinking, and a new view- 
point which does not comport with the old is hard to achieve 
and tempting to resist. Nevertheless, it is in this direction that 
we must go, and the last major topic of this text will be to show^ 
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how verbal behavior can be subsumed within objective psy- 
chological science. Our phrase "verbal behavior" covers all 
aspects of language spoken, written, and gestural but we 
shall limit ourselves to the spoken, since the extension of our 
analysis to the other types would introduce further complex- 
ity in detail but no new principle. 

The Nature of a Behavioral Theory of Language 

Verbal behavior (spoken) is composed of responses of mus- 
cles in the mouth, throat, and chest which we may call the 
vocal apparatus. These responses produce various combina- 
tions of sounds which serve as exteroceptive stimuli for the 
person hearing them and the one emitting them. In addition, 
the responses produce proprioceptive stimuli in the speaker 
that play a part in directing chains of verbal responses. Spoken 
sounds, or the muscular responses causing them, do not 'nat- 
urally' indicate objects or events in the outside or inner world. 
The fact that adult speech bears relation to the environment 
in a more or less lawful manner is something to be scientif- 
ically explained, rather than taken for granted. How such a 
correspondence arises is a central problem for analysis, just 
as it is with an animal whose operant responses become con- 
ditioned and extinguished so that they finally provide a com- 
merce with the world that is based on its actual features and 
requirements. 

A behavioral treatment of language will, then, take as its 
data the sounds emitted by the human organism, just as it 
takes any observable behavior, like the rat's bar-pressing. 
When a person says something, our concern is with the saying 
and with the conditions that control the saying. Enunciation 
is an act, and to relate this act to its controlling factors is to 
understand the 'meaning* of speech. Our treatment will start 
with the young child and show the processes whereby its 
sounds become transformed into language. Our interest will 
not be in the historical origins of verbal behavior in the 
human species, but in the genesis and development of Ian- 
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guag;e in the individual as an outcome of present and past 
variables working upon him. 

That verbal responses were to be regarded as raw data was 
one of the remarkable things proposed over thirty years ago 
by John B. Watson (1919). Watson asserted that when a 
human subject in an experiment spoke of his 'awareness/ or 
'consciousness,' or 'perceptions/ it is a mistake to hypothesize 
the reality of these things. The subject's 'introspective' words 
do not objectively 'report' anything, but should be recorded 
as additional data in themselves, though Watson was not 
clear on how they could be analyzed in any useful way. He 
argued that inner mental events have no independent observ- 
able existence, that we have only the subject's words to deal 
with as our data, and that there is no necessary reason to erect 
a human psychology on the supposed presence of an inner 
controlling psyche. Watson's views were received with hostil- 
ity by most of his contemporaries because they were radical 
for that time, and because they were not developed to a stage 
where their fruitfulness could silence criticism. Moreover, as 
we shall see, they were wrong in one important way. Verbal 
responses do become correlated in the course of training with 
interoceptive and proprioceptive stimuli, so that verbal re- 
ports of inner events acquire some degree of credibility. A 
behavioral account of how these correlations are set up does 
not, however, depend on psychic forces that supposedly em- 
ploy words as a medium of communication. Watson was thus 
correct in his general approach, though he erred in detail. 
After all these years, we can more justly appreciate the sound- 
ness of his thinking, and the acuteness of thought which led 
to his beliefs. 

There are many aspects of verbal behavior about which we 
have no experimental data at present. Even so, the principles 
with which we are already familiar enable us to dissect verbal 
processes in a way that conforms neatly with what is known 
about behavior generally. Perhaps the most valuable thing 
you will get from our discussion is a perspective and an atti- 
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tude toward verbal behavior that will help you avoid some of 
the mystery in this field and help you think about language 
in a scientific way. 

The Emergence of Verbal Behavior in Childhood 

Vocalization alone does not constitute language, and the 
sounds an infant makes are only the raw material from which 
his verbal behavior is formed. At first, his repertory of sounds 
is limited, but as he matures the range of vocal responses in- 
creases. No one language makes use of all the basic sound 
units (phonemes) that the human vocal apparatus can pro- 
duce, but each selects out a number which go into its words 
and ignores others. Thus, the English language makes no use 
of a phoneme like the ch in the German nicht, or of the nasal- 
ized on of French. The infant's repertory of sounds grows 
rapidly from birth and includes more than his own native 
tongue will need, so that, in his language training, part of 
his repertory is retained and strengthened, and other parts 
extinguished. You may recall your own amusement, and your 
teacher's horror, when you tried to pronounce a foreign word 
in high school or college language courses. 

It is, of course, of interest to know how phonemes become 
available in the child's repertory, since this availability is 
related to the rate at which we can expect children to acquire 
proper pronunciations. Data of this kind for English-speak- 
ing children were recorded by Irwin and Chen (1946), in usual 
home environments, and represent the normal phonemic 
growth that occurs without special training in differentiation. 
Irwin and Chen determined the number of native-tongue 
phonemes spontaneously uttered by infants, and plotted the 
growth curve (Figure 77) for the number of types of speech 
sounds appearing at various ages. A total of 95 infants from 
middle-class homes was studied during the first 30 months 
after birth. At a given visit, a child's sample vocalization, as 
uttered on 30 respirations or breaths, were transcribed in the 
International Phonetic Alphabet. The data are plotted for 



SOCIAL BEHAVIOR 38 1 

time units of two months, so that the developmental curves 
for the 30 months are based upon 15 points, each point rep- 
resenting the mean number of types of speech sounds for the 
corresponding two-month period. The curve for all children 
shows that from the first to the last two-month period the 
child passes from 7.5 to 27 of the 35 sounds present in adult 
English speech. 
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FIG. 77. The increase in number of English-language phonemes emitted 
by infants at different ages. (After Irwin and Chen, 1946.) 

In addition to variety of sounds, an infant's vocalizations 
may be analyzed with respect to the frequency of appearance 
of each type. It has been found (Irwin, 1947) that, whereas the 
mastery curves for phoneme types show a decreasing rate with 
age, the frequency of production is a positively accelerated 
function. Thus, as the child grows older, he not only makes 
regular progress in mastering his native tongue's phonemes, 
but his use of these sounds increases at a faster rate. While 
the available data for any given age do not permit analysis of 
speech sounds into discriminative and non-discriminative 
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operants, it is plain that opportunities for elaborate verbal 
conditioning are present in the early months of life. 

The first few years' growth of the child's language is swift, 
with so many facets developing from week to week that a 
close, deliberate analysis of the individual case is rendered 
extremely difficult. Different workers in this field have often 
used different schemes for classifying early language, the 
aforementioned phonemic count method being one possibil- 
ity. Language growth, however, runs through what some 
investigators feel to be different stages of early development 
that are glossed over in a simple count of phonemes. Even 
these stages may be differently listed, but an illustrative line- 
up is that of Eisenson (1938): 

a. The undifferentiated cry occurring at birth and shortly 
thereafter; this crying does not differ noticeably for hunger, 
thirst, noxious stimulations, and the like. 

b. Differentiated crying occurring after the first month 
of life; the cries differ, so that the cause may be discerned by a 
familiar observer like the mother. 

c. Babbling may begin at the end of the second month; 
there is a polyglot phonemic content, weighted on the side of 
vowels. 

d. Lallation repetition by the child of his own vocal pro- 
duction, leading to perseverative vocalization; begins about 
the sixth month or soon after. 

e. Echolalia repetition or imitation by the infant of 
sounds made by other persons; begins at about the ninth or 
tenth month and lasts indefinitely, perhaps throughout life. 

/. Verbal utterance the use of recognizable words in re- 
sponse to stimuli or to control the actions of people; usually 
begins in the first half of the second year. 

The flow of speech development is, of couse, divisible into 
as many stages as we wish, and only practical or theoretical 
considerations can inform us when we are to stop. At the 
present time, any segregation of stages is of limited fruitful- 
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ness, although all attempts of this kind emphasize the large 
changes going on in the early years. That all children go 
through roughly similar changes is itself a provocative ob- 
servation, since it indicates that there are probably uniform 
underlying processes at work. For example, there is the long- 
known fact that congcnitally deaf children do not learn to 
speak. They reach the babbling stage, and probably get to a 
limited lallation, but never progress further. Apparently the 
hearing of speech is a condition for language development. 
Special coaching methods have been worked out, however, for 
teaching deaf children to speak, and there are a few institu- 
tions for this purpose in the United States. These methods 
represent the educational art at a high level, but a scientific 
survey of the principles involved could well lead to a better 
understanding of language behavior as well as possible further 
refinement in the methods themselves. This is almost un- 
explored territory awaiting the attention of trained psy- 
chologists. 

Two Functions of Language: The Mand and the Tact 

Phonemic availability and the listing of developmental 
stages do not, of course, give us information about some 
things. More important for our understanding of verbal be- 
havior are the processes by which it is learned, and the condi- 
tions under which it is used. The forms and functioning of 
language are our central problems. We take the operant 
speech repertory as the raw material on which our principles 
operate, and ask how the specific characteristics of verbal 
behavior arise. We note, first, that the individual's speech 
results from the training given him by those who make up his 
"verbal community/' Without a verbal community to teach 
him, an individual would never develop verbal behavior. In 
asking how the individual acquires speech, therefore, we 
shall find ourselves dealing with other people who are his 
"hearers," and who are the instruments for training his 
vocalizations. 
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Given, then, the phonemic raw material, the principles of 
conditioning and an appropriate teacher, our analysis of lan- 
guage can get under way. We may identify two basic cate- 
gories of verbal behavior which appear early and remain 
fundamental throughout an individual's lifetime. These are 
the mand and the tact. 

i. The mand (from the Latin mando, to order or com- 
mand) is an utterance which procures a specific type of rein- 
forcement from the hearer. A child can reach for his milk or 
he can make a sound milk and the two responses are not differ- 
ent in any functional way. Both are reinforced in the same 
manner except that in the latter case the reinforcement is 
mediated by another organism. The mand as a response is 
strengthened by its consequences just as any operant such as 
bar-pressing, chain-pulling, or running through a maze. In 
the grammarian's terms, the mand is in the imperative mood, 
and it includes demands, entreaty, commands, requests, and 
so forth. Mands are probably the first functional elements to 
appear in the language behavior of the child. They are de- 
veloped under the instigation of primary drives and are the 
first verbal responses to be reinforced by hearers like the 
mother or nurse. When an infant is hungry, for example, his 
general activity, including the vocal, is heightened, and differ- 
ent sounds, interpreted by the mother as approximating 
intelligible words, are fastened upon by her for differential 
reinforcement. She proceeds to condition these sounds by 
supplying milk when the infant says something vaguely re- 
sembling milk, supplying a toy for a sound like tuh, and so on. 
Jespersen (1922, pages 154-155) has put it in these words: 

. . . The baby lies and babbles . . . without associating the slightest 
meaning with his mouth-games, and his grown friends, in their joy 
over the precocious child, assign to these syllables a rational sense. . . . 
It is very natural that the mother who is greeted by her . . . child with 
the sound 'mama' should take it as though the child were calling her 
'mama/ and since she frequently comes to the cradle when she hears 
the sound, the child himself does learn to use these syllables when he 
wants to call her. 
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The conditioning of mands, therefore, is one in which the 
form of the response is specific to the particular reinforcement 
obtained. The 'meaning' of a mand is given by what con- 
sequences follow its emission in a particular verbal commu- 
nity. Mand emission is controlled by the organism's needs 
since these mands have a history of appropriate reinforcement 
under those needs. Mands are not originally discriminative 
operants, but they usually become discriminative because the 
likelihood of reinforcement is greater in some circumstances 
than in others. Thus, manding food or drink usually occurs 
in the presence of certain people and in certain places. Never- 
theless, as with other discriminative operants, if the drive 
becomes strong enough, they may be emitted, even by adults, 
in the absence of the customary S D 's. Sometimes, by extension 
or generalization, manding will occur in situations where the 
local S D 's have never themselves been reinforced. The "mag- 
ical mand" is of this variety, an example being that of the 
bowler who calls after his swerving ball "get in there!" 

We have said that mands are conditioned by the verbal 
community which gives specific reinforcement to specific 
sounds or to words of a specific form. The sounds or words 
selected from the child's repertory for mand training depend 
on the conventions of the verbal community, that is, the 
native language. There is nothing inherent in the sounds that 
necessarily places them in the mand category. Any words can 
serve as mands if they are so conditioned and if they function 
that way upon hearers. Mands are made so by their past his- 
tory of specific reinforcement, and they disclose their charac- 
ter in the behavior of the hearer whose reactions are thereby 
controlled in specific ways. In the case of high drive or stimu- 
lus generalization, manding responses may appear that are 
not actually effective in procuring reinforcement, but the 
behavior is traceable to prior history of the sort we have 
indicated. 

2. The richness and versatility of language in human life 
would be greatly restricted were its content limited only to 
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mands. Of greater importance is the tact function of verbal 
responses (tact from the past participle, "tactus," of the Latin, 
"tango/' to touch). Whereas the mand is only incidentally 
under the control of S D 's, and is a response whose form deter- 
mines a specific reinforcement, the tact is of a different com- 
plexion. The tact relation is one in which the form of the 
verbal response which the community reinforces is related to 
particular discriminative stimuli, and the response is given, 
not specific reinforcement, but generalized reinforcement. 
We may think of tacting as a 'naming' function: if the speaker 
emits the required sound when a given S D is present, he is 
reinforced. The tact is not motivated by a special need of the 
speaker, and it does not call for a special reinforcement. The 
correlation between the response and its proper S D must, how- 
ever, be reinforced somehow, and what we observe is that the 
reinforcement provided by hearers is a general one. This 
reinforcement may be in the nature of smiles, approval, or 
money, all of which have in the past been associated with 
many types of primary reinforcement and are now, on that 
account, secondary positive reinforcers operating effectively 
in a wide range of situations. It has been said that the notion 
of generalized reinforcement is probably the most important 
single characteristic of verbal behavior, since it gives the 
verbal community almost unlimited power to train the indi- 
vidual. The vital result of this training is that the verbal 
responses of the individual are made to stand in a dependable 
relation to the environment. In the grammarian's terms, the 
tact relation is that of the declarative sentence; it is an an- 
nouncement of "fact" representing relatively disinterested 
behavior on the part of the speaker, behavior for which he 
gets nothing in particular but only something in general. 

Of all the verbal behavior acquired by the individual, the 
tact relation is of the deepest value both to himself and to the 
community. By teaching him to correlate words with facts 
and states of affairs, both inside and outside himself, the com' 
munity opens up for him the opportunity to participate in, 
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and contribute to, human discourse and human wisdom. If 
human language were limited to mands alone, it would differ 
little (except for size and clarity of the vocabulary) from the 
grunts and barks of lower animals. With mands alone, we 
would never rise above the level of making our personal needs 
and states discriminable to other persons who might then 
(were they reinforced for doing so) act to alleviate our hunger 
or thirst. It is the tact relation, however, that makes man able 
to speak of, to 'know/ and be able to 'think' about, the world 
and himself. 

The Speaker-Hearer Relation 

In any verbal community, the role of speaker changes hands 
continually, as does that of hearer, so that a given person is 
now one and now the other. An analysis of verbal behavior 
must cover this double-ended process if we are to glimpse the 
full scope of verbal functions. A key question we can take off 
from is, what reinforcement does the hearer afford the speaker 
for speaking, and what is afforded the hearer for listening and 
reinforcing the speaker? Who profits, and how? Clearly, both 
speaker and hearer must profit if they are to act as either, but 
we must see how this is effected. 

INDfVIDUAL A: Rj(S> RjJ(Sj' D ) 



V /QD\ D v fQf*D\ D v /tc r 'D\ 

i^i* n.lb, J K.tbg ' 

\y v/ \ 



INDIVIDUAL B: RjCS^) Rj(S r 4 u ) etc. 

SPEAKER-HEARER RELATION 

R* = "What time is it?" R* = "Don't mention it." 

R v _. Twe i ve oc iock." R* = "How about lunch?" 

Rg = "Thank you." 

The speaker's speaking is reinforced by the hearer in two 
ways. His mands are specifically reinforced according to his 
needs, while his tacting secures generalized reinforcement 
from the hearer who thus "encourages" him to go on. The 
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hearer has his own reasons for listening and reinforcing the 
speaker. If he reinforces the speaker's mands, he gets rein- 
forced himself in one or both of two ways: generalized rein- 
forcement from third parties for doing the ''proper thing," 
and somewhat more specific, though delayed, reinforcement 
from avoiding trouble caused by not meeting the speaker's 
needs. A mother, for example, responds to her child's mands 
as she is "expected to" by the community which requires her 
to be a dutiful mother; but she may also do so because she 
thereby prevents such later troubles as coping with an under- 
nourished or injured child. 

An interesting problem, however, is why a hearer reinforces 
a speaker's lading. Hearers reinforce tacting because they 
make use of the information they obtain from the tacting. 
Thus, a speaker may say "there is food in the refrigerator," 
or "fire!" with resulting reinforcement to the hearer if he 
bases his own behavior on these tacts. The hearer gets an 
ultimate reinforcement for giving a speaker immediate re- 
inforcement, since good tacting provides guides or S D 's for the 
hearer's own successful responding. Each hearer obtains from 
many speakers in his community much more knowledge than 
he could obtain by himself; each person in a community 
benefits all others by acting as additional eyes and ears for all, 
and this is as true for opinions and meditations as for straight 
reporting of environmental facts. Because of this mutual 
profit, the community plenteously reinforces tacting in the 
verbal behavior of its members. Note that we are considering 
the "informative or communicational value" of speech, with- 
out resort to the dualistic notion that speech is merely a 
medium for expressing 'ideas' in the 'mind/ Tacting responses 
are acquired according to conditioning principles, and the 
behavior of hearers may be explained in the same way. 

We have pointed out earlier that verbal behavior would 
never develop without a verbal community which mediates 
the reinforcement for it. The fact that reinforcement is at the 
disposal of the hearer means that the hearer is the one who 
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acts as the teacher of speech. Any speech response is a complex 
pattern of muscular action in the vocal apparatus, and as we 
all know it takes a long time to acquire acceptable pronuncia- 
tions and inflections, in addition to grammatical forms and 
sentence structures. Involved here is the matter of response 
differentiation, for which the responsibility lies in the hands 
of the hearer-teacher. As the child matures, hearers use the 
devices of selective reinforcement for, and successive ap- 
proximations to, the desired sounds. By this social insistence, 
the child's speech is gradually molded into accepted forms. 
"It is not yeah, son, but yes can't you say it properly?", we 
tell our youngsters. Or we may grow angry and threaten to 
withhold some desired thing unless the child abandons his 
baby talk and speaks as "a big boy should/' Furthermore, 
children are soon conditioned to imitate the sounds of adults 
so that the active role of the hearer as a differentiator may be 
reduced to the point where he is unaware that he has done 
anything. You have undoubtedly observed, as did Mark 
Twain, that French children learn French and Chinese chil- 
dren Chinese, without any trouble! 

An interesting and extremely important variety of tacting 
is that which has internal stimuli as the discriminative source. 
From childhood on, the individual is taught by the commu- 
nity to report to hearers on his interoceptive and propriocep- 
tive stimuli. The adult can, for example, tact the location 
of pain, whereas the child may be in obvious distress and 
yet unable to say even what part of his body hurts. Many 
expressions beginning with "I feel ..." are tacts of internal 
stimuli and states like pain, sleepiness, fatigue, tension, 
hunger. Such tacts are an essential source of information 
about goings-on inside the individual which are otherwise 
inaccessible to an observer. To the community, this informa- 
tion is necessary for meeting the individual's needs, assuring 
his survival, and predicting what he is going to do. The indi- 
vidual learns such tacting because it is heavily reinforced: if 
he says his tooth aches, it can be relieved; if he says he is 
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melancholy, he can be cheered; if he speaks his fear, he can be 
calmed. 'Introspective" reports of feelings, thoughts, and the 
like, are what have led to the dualistic conception of an in- 
ternal mind as separate from the language expressing it. No 
one doubts that hidden internal S D 's for such tacting exist, 
and that as a result of training the tacts do serve to give 
information about them. But a scientific theory of verbal be- 
havior takes the tact relation as a datum, and refers its origin 
to known principles, rather than to some mind-agency within 
the individual that somehow knows which word should go 
with which stimulus, and thereupon sorts them into meaning- 
ful speech. The dualistic view takes the existence of referents 
for words, the correspondence of speech with fact, and the 
conveying of information to hearers, as matters that need only 
to be logically or grammatically formalized. The science of 
behavior asks how these things come about. 

You may ask at this point what makes us decide that some 
introspective tacts are valid and others not. Why, for example, 
do we question "mind" if a person says, "My mind is work- 
ing"? To this and many similar questions, we can reply that 
there is no doubt the speaker is saying something, but he is 
not necessarily tacting what he supposes he is. Thus, a person 
who says "I am thinking" does not have ghostly thoughts as 
his S D , but rather muscular responses of the vocal apparatus 
(and perhaps other parts of the body) which he tacts as "think- 
ing." A child wrestling with a problem is told by his parent 
"My, but you're thinking hard, aren't you?" and he gropingly 
replies "am I thinking?"; thereafter, similar activity will also 
be called thinking although he learns in later life to make 
vague statements about the "nature of thought/' We all think, 
but we do so with our muscles which provide the only S D 's for 
the tact "thinking." If we are asked, however, to describe 
thinking we do so incorrectly because we have learned to talk 
about thinking in a certain way. We may say that we "feel our 
brains working," but this is not a valid tact since neural 
activity in the brain is not itself perceptible and no discrim- 
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inative response can be anchored to it. We could examine in 
this manner all the misconceptions about themselves that men 
evince in their verbal responding. For the moment, it will 
suffice if the basic viewpoint is grasped to some extent, for 
this is perhaps the most difficult reorientation of all for begin- 
ning students in psychology. 

Distortions of the Tact Relation 

Ideally, the tact relation would contain a response made 
discriminatively to a single S D . In actuality, this relation is 
subject to many deflections, and some of the more important 
kinds may be noted. In the following cases, you might provide 
your own running commentary on the social acceptability of 
the distortions, and of the way they are learned, extinguished, 
or punished. 

i. Generalizations of the S D . Since the tact is a discrimina- 
tive operant, generalization of the S D is naturally to be ex- 
pected. Such generalizations may have practical use. They get 
information to a hearer more briefly than otherwise, or per- 
haps more comprehensively than by direct statement, because 
the hearer's own background of experience is called into play. 

a. Metaphorical extension: this is used not only by poets 
but in ordinary conversation. "Thin as a reed," or "strong as 
a horse/' or "fast as a deer," are commonplace similes. Con- 
tractions may occur if the speaker is sure his hearer will 
understand, as with "she is a flower." 

b. Generic extension: here the generalization is sanctioned 
by the usage of the community. We speak of "players" in 
different sports and games, yet some are professionals who are 
clearly not 'at play/ 

c. Redintegration: a form of generalization in which a 
part of a stimulus complex is used to stand for the whole (see 
page 152 in Chapter 5). "Counting noses," "reciting Shake- 
speare," are such expressions in which noses stand for people, 
and one recites only what Shakespeare wrote. 

2. False and non-existent S D 's. These are instances where 
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a speaker pretends to tact although there is no S D for it, or 
where the speaker tacts incorrectly. 

a. Lying, or contrary-to-fact: this may be done to avert the 
punishment that a truthful tact would incur. ''No, I did not 
break that, Mother/' A false tact may also be used to obtain 
specific positive reinforcement. "I got an A in arithmetic to- 
day/' says John, if that is the only method he has to get his 
candy. Of all people, we depend on our scientists to be truth- 
ful tacters without specific rewards or inducements. Imagine 
the catastrophe if they had to fabricate data in order to eat. 
A scientist must be free of pressures that turn tacts into con- 
cealed mands, and society sets before him standards of honesty 
that are more heavily reinforced than standards of success. 

b. Exaggeration or invention: this is a way of obtaining 
generalized reinforcement from an audience that would 
otherwise withhold it. We have all listened to tall stories, em- 
bellished autobiographies, and dubious tales of personal 
conquest. 

c. Distortions that produce special effects on the hearer: 
telling jokes and 'tear-jerkers/ 'crying wolf to make a person 
do something we want, and telling stories with a moral are 
instances where the hearer's behavior is to be manipulated in 
special ways. 

A listing like the above does not avoid some overlapping 
between types of tacting. Its purpose is to point up different 
ways that tacting appears in the stream of speech activities, 
so that you may get a feeling for the processes involved. 

Other Controlling Factors in Speech 

Whereas mands are of value most directly to the speaker 
whose needs they serve, the tact relation is of greater value to 
the verbal community. Tacting is the quintessence of social 
behavior, and of all verbal behavior it is the part we are most 
concerned to understand. We may, therefore, take a bit more 
time on some factors that control responses of the tact class. 
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Some of these factors may be obvious, but even they are worth 
making explicit. 

1. Textual factors. Verbal responses are made to certain 
stimuli by way of reading, that is, we tact the marks on paper 
that make up writing. Here, the form of the response has a 
particular conventionalized relation to the written symbols. 
Schools devote much of their curriculum to the teaching of 
reading. Moreover, some written matter gives directions foi 
continued verbal responding by the reader for example, 

mathematical symbols like -f-, ~- , + 1 and we then get 

into the problem of how such follow-up responses are taught 
to the individual. 

2. Echoic responses. We mentioned earlier that there 
seems to be a stage when the child begins to imitate or echo 
the sounds made by others. But even before he learns to speak 
words, his babbling acquires the inflectional pattern of his na- 
tive tongue, so that he babbles English, as it were, even before 
he can speak it. These observations indicate the importance 
of the heard sound. Later on in life, it is easy to evoke echoic 
behavior by simply instructing a person, "repeat after me." 
Moreover, it is likely that all of us tend to repeat speech we 
are listening to even without being told to do so. We murmur 
to ourselves as the speaker talks; we use his words to introduce 
our own sentences; we go along with him and complete his 
sentences for him. In an exploratory study of this problem, 
Ritter (1949), using adults as subjects, had them memorize 
nonsense syllables. The instructor then called out one of 
these syllables, with the subjects instructed to respond as 
quickly as possible with the first syllable they thought of. 
They could respond with any of the memorized syllables in- 
cluding the one called out. The data showed a notable tend- 
ency for them to repeat the heard one despite the fact that 
they had a number of other well-learned choices available. 
Echoic control over verbal responding seems real enough, but 
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we need to know much more about it before we understand 
it fully or can estimate its magnitude. 

3. Audience. The speaker's audience provides an imme- 
diate control over his verbal responding. A given part of the 
verbal repertory may be activated by one audience and left 
untouched by another. We would not address a minister as 
we do our fraternity brothers the tone and content alike of 
our speaking would differ. Recall that a hurt child will cry 
more when there are people about than when alone, and still 
more when, of all people, its mother is there. Where audience 
control of speaking fails conspicuously, we have a symptom of 
psychopathological behavior in which verbal output is, we 
say, not 'in contact* with the immediate social situation. 

4. Motivation. Variations in drive strength affect the 
strength of verbal responses just as they do any other kind of 
behavior. This is quite clear with respect to mands, but tacts 
are also susceptible. Slips of the tongue, unwitting puns, and 
so forth, in a speaker's output often indicate some motive at 
work in him that has taken priority over the usual S D 's. Freud 
has made extensive use of this observation, but it would be 
unwise to conclude that every slip or pun has a motivational 
basis. There is multiple causality in speech, and sometimes 
errors and distortions occur because the tongue is sidetracked 
by similar phonetic words or by the intrusion of related 
themes from the speaker's history. An invention (Skinner, 
1936) called the "verbal summator" brings out many of the 
factors mentioned here. Combinations of vowels are played 
from recordings at a low loudness, each combination being 
repeated a number of times. The subject is told that "this is 
a test in the clarity of speech'* and is asked to listen carefully 
and to put down, as soon as he thinks he has detected it, "what 
the man is saying." Of course, the man is not actually saying 
anything but a vowel series like "uh-ee'-uh-uh-uh," but this 
acts to select out of the subject's repertory words or sentences 
that are, for him, the stimulus. He reports "hearing" the man 
say such and such, but what he really hears is himself saying 



SOCIAL BEHAVIOR 395 

such and such in response to the meaningless sounds of the 
recording. It is as if the repetition of incoming stimuli builds 
up the strength of the subject's latent speech, and soon those 
items that are closer to the emission threshold "come to the 
surface" and are spoken out. The verbal summator is related 
to testing methods which have been called "projective tech- 
niques" in the study of personality. Different subjects re- 
spond in different ways to the same meaningless stimuli, and 
their responses are often revealing of their motives and their 
history. Tabulations of response frequencies to the summator 
often show that the most common responses refer to the op' 
posite sex, to worries, the wish to go home, and so on. When 
normal persons are compared with mental hospital patients 
(Shakow and Rosenzweig, 1940) gross differences appear in 
their responses to the summator. This is not wholly unex- 
pected, since we already know something of how these con- 
trasted groups perform on many other tests. The investigation 
indicated, however, that the summator technique deserves to 
be explored more thoroughly in the analysis of normal speech 
and psychopathic deteriorations. 

Further Characteristics of Verbal Behavior 

Broadly considered, verbal behavior is subject to greater 
delays in reinforcement than are some other ways of respond- 
ing. Mands, for example, require action by hearers who 
mediate the reinforcements, and often the speaker could get 
his reinforcement more quickly by reaching or moving him- 
self. Children must learn to ask for bread rather than grab 
for it. Moreover, in both manding and tacting, we may fre- 
quently be disappointed by our hearers who fail to supply 
any reinforcement whatever. Even our polite requests may 
often be rebuffed. The result is that we often speak with less 
assurance than we act. 

Vocal language has special advantages for the range and 
utility of our verbal behavior. There is no necessary reason 
why language could not originally have been gestural or 
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tactile. As we may conceive it, however, the speech type was 
selected out owing to its greater versatility and value in pro- 
curing reinforcement (perhaps by leaving the hands free). 
Constructed as we are, olfactory, heat, or radiational lan- 
guage would not be an effective basis for a verbal community. 
Sight or hearing would be more likely bases for interstimula- 
tion at a distance. Of these two, however, a language of seen 
movements is limited by the necessity of looking and by its 
ineffectiveness in the dark. Moreover, a vocabulary of move- 
ments-to-be-seen would be restricted to the number of dis- 
criminably different movements made by the 'talker/ and this 
number is not large. Finally, such a vocabulary is not easy to 
execute but requires considerable expenditure of energy that 
is negatively reinforcing to some degree. On the other hand, 
speech gets around most, if not all, these difficulties and is, 
in addition, capable of an indefinitely large vocabulary. Re- 
sponses of the vocal apparatus can be differentiated very finely 
to produce an enormous number of sound combinations that 
are discriminably different to the hearer. Speech movements 
are small and cost little energy, they can be rapidly made, 
their intensity can be varied as well as their pitch, they are 
easily combined into chains, and so on. We may think of lan- 
guage functions as having located themselves in the vocal 
apparatus by virtue of a kind of natural selection that we can 
describe in terms of reinforcement principles. 

The first years of language development in children are 
commonly characterized by the fact that their verbal behavior 
is audible, that is, children do all their talking aloud. It needs 
schooling and social pressure to quiet down their speech so 
they do not say aloud everything that 'comes into their heads/ 
Even as seemingly simple a thing as silent reading takes up a 
good deal of teaching time in the early grades. And, even as 
adults, we may talk aloud to ourselves when we are alone; 
and, by the same token, mental patients showing 'withdrawal 
from society' (i.e., a loss of control over behavior by social S D 's) 
talk to themselves aloud without regard for the presence of 
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others. Silent speech is an important part of thinking. When 
engaged on difficult tasks like mathematical problems, we are 
likely to abandon silent speech and revert to 'thinking aloud.' 
Perhaps the suppression of talk is not altogether good for our 
intellectual efficiency. It does, of course, conceal from our 
neighbors things we do not want them to know. Yet, from 
the standpoint of mental hygiene, it may be better to talk 
out our feelings than to let them rankle. 

In general, speech uses only the smaller muscles of the 
body, and in silent speech or thinking still smaller movements 
of these small muscles are involved. But man's speech is a 
mighty lever, and one may truly say that never did such small 
muscles move such large worlds. 

Consciousness and Understanding 

Verbal behavior can itself act as the S D for further verbal 
behavior. That is, we can talk about our talking, or think 
about our thinking. We may use the term secondary language 
to designate verbal behavior that has verbal behavior as its 
S D . As we have said, "consciousness" is probably reducible 
in the end to the ability of a person to verbalize adequately 
his own actions, including his prior verbal responses. We say 
he is 'oriented' or 'self-aware* when he can tact his own be- 
havior in about the same way that we would. 

We are satisfied, also, that a person comprehends or under- 
stands us, not if he echoes what we have just said, but more by 
his ability to say it in a different way. Merely to repeat our 
words verbatim tells us only that our own verbal chains were 
memorized. If he can start with our words and go on to 
further statements that we ourselves might make, we conclude 
with more confidence that he has understood us. Understand- 
ing is a verbal activity, for the person who understands us 
talks as we do. Like all behavior, understanding needs rein- 
forcement if it is to be made firm. This being so, one may 
examine various forms of instruction used in schools, for 
example, to see which provides the best reinforcement for 
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comprehension, that is, proper speaking or thinking. From 
such an examination, the lecture method of teaching, so 
common at the college level, emerges with a poor recom- 
mendation. The lecture method provides no reinforcement 
for the hearer's speech except the long-delayed one of exam- 
ination and final grade. It is not to be wondered that, with the 
small comprehension achieved by lectures, students often 
aim on examinations only to reproduce verbatim the lec- 
turer's words. The notes that he takes are a means of stimulat- 
ing talk in himself after he is alone (a kind of lecturer-student 
conversation without the lecturer present), but can only par- 
tially succeed in filling the gap left by the absence of imme- 
diate reinforcement for the hearer's talking. On the other 
hand, the laboratory method of instruction, and the discus- 
sion-group method, hold out the possibility of superior learn- 
ing if they are well used. One danger in discussion groups 
that are not skillfully conducted is the tendency to reinforce 
every discussant for saying almost anything as his "contribu- 
tion," so that no one carries away any more comprehension 
than he brought with him. But the lecture method inherently 
promises little for any but selected audiences already so well 
trained in a special subject matter that the heard words fit 
into, and generalize with, a verbal repertory prepared in 
advance and primed at high strength while listening. 

Problems of individual understanding are joined with 
those of common understanding in the community, that is, 
with verbal usages that are expected to have a common mean- 
ing for all persons. The "meaning" for a speaker of a word or 
sentence is, of course, defined by the sum total of conditions 
under which it is emitted; while the "meaning" for a hearer 
is defined by the behavioral consequences it induces in him. 
As background and training differ or agree for different 
speakers, the conditions evoking a given word from them will 
overlap but perhaps not coincide exactly. Similarly with the 
partial coincidence of reaction on the part of hearers. The 
study of semantics centers on questions of meaning like this, 
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and it seems probable that the behavioral and semantic ap- 
proaches to language will draw closer together in the future. 
It is interesting, finally, to conjecture what the impact may 
be upon philosophy when the behavioral analysis of language 
reaches a more formidable stage than at present. Language is, 
in a way, both the medium and substance of philosophy. It 
would be strange indeed if an inquiry into the nature and 
origins of verbal behavior had no bearing upon the import or 
validity of that same verbal behavior. For, even if it is true 
that an idea may be valid without our knowing its source, it 
is also true that in many cases we cannot judge its validity 
except from its source. In like manner, it would be strange if 
our knowledge of the medium in which philosophers speak, 
take issue, or agree with one another, did not help us also 
know when their talking is sound and fury, and when it 
signifies something. No one can say how far in the future lies 
the first strong impact of behavioral science upon philosophy, 
but that it will come seems certain. In a science that takes the 
whole of behavior as its province, what part of man's activi- 
ties shall be said to lie out of bounds and be exempt from 
scrutiny? Who can justly deny her the right of passage 
through any meadow, and on what basis declare that she 
trespasses? 

NOTES 

Social psychology is such a large and inclusive province that there is 
often little similarity in the content of textbooks within the field. An 
interesting approach is that provided by Klineberg's Social psychology 
(1940), the material of which may be used to supplement some of the 
topics we have treated in this chapter. A handbook of social psychology, 
edited by Murchison (1935), is also to be recommended, especially for 
its description, by various authorities, of sub-human social behavior. 

An enlightening experiment on cooperative behavior in chimpanzees 
has been carried out by Crawford (1937). This investigator's exposition 
lends itself readily to the type of analysis you have been using, and 
supplies a splendid approach to the complexity of human cooperation. 
Maslow's (19363, 19360) observations on the dominance-submission 
relation among baboons serves a similar purpose. With respect to imi- 
tation, we have already mentioned (page 361) the studies reported by 
Miller and Dollard (1941). 



400 PRINCIPLES OF PSYCHOLOGY 

In our treatment of verbal behavior, we have leaned primarily upon 
the analysis recently presented by Skinner (ig48b) in a series of formal 
lectures at Harvard University. Skinner's approach amounts to the 
first important extension of reinforcement theory into the realm of 
human language function, and is an impressive attempt to free the 
problem of language from mind-body dualism. 



A LAST WORD 

The power of science is nowhere in history more stirringly 
revealed than in its first victories over the most difficult and 
defiant subject matter ever presented to it the behavior of 
organisms. This book has attempted to plot, in general fashion, 
some of the paths that have been opened up in man's study 
of his own behavior and that of his fellow-creatures. Starting 
with the humble analysis of simple bits of behavior, modern 
psychology has extracted a few shining conclusions that dispel 
somewhat the mists which once shrouded human and animal 
behavior and made an understanding of its laws a seemingly 
impossible goal. 

All behavior, as we can now discern it, is composed of varia- 
tions on a few basic themes. For the first time in mankind's 
saga, these themes are open to all who wish to see them in the 
steady light of science, rather than by the rare illuminations 
of intuitive minds. We are on the frontier of an enormous 
power: the power to manipulate our own behavior scientif- 
ically, deliberately, rationally. How this power will be used 
whether for good or ill no one of us can tell. Certain it is that 
whatever use is made of it will be determined by the character 
of the persons using it. But character itself is open to a science 
of behavior. We need to hasten and train a generation of men 
of good will. How this is to be done may be mankind's last 
desperate question of all. Without a science of psychology, no 
answer is possible; but psychology, while offering the 
methods, cannot ensure their use. It is to the latter that we 
finally commend our readers. 
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32*> 323 

vacillation, 320, 322, 323 ff. 
Consciousness, 370 ff., 379, 397 
Consistency of personality, 371 ff. 
Constancies, perceptual, 151 ff. 
Context theory of meaning, 2igff. 
Cooperation, 357 ff., 399 

defined, 357 

discrimination in, 358 ff. 

reinforcement in, 358 ff. 
Covert response, 212 ff., 217 ff. 
Cues to depth perception, 
Cultural environment, 363 ff. 
Cultural relativity, 3i4ff., 368 
Cumulative curve, 42 ff. 
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Cyclic deviations 

of rate in extinction, 71, 90 ff. 
of latencies in avoidance, 314 



Deaf, learning of speech in, 194(1". 
Deaf mutes, thinking and dreaming 

in, 214 if. 
Definitions 

anxiety, 309 

aversion, 303 

cooperation, 35; 

latency, 9 

limen, 8 

negative reinforcement, 61 

operant behavior, 49 ff. 

operant level, 76 

psychology, 2 ff. 

reflex, 4 

reflex strength, 10 ff. 

reinforcement, 61 

respondent conditioning, 30 

response, 3 

secondary reinforcement, 232 ff. 

stimulus, 3 

threshold, 8 

Type R conditioning, 47 ff. 

Type S conditioning, 47 ff. 
Delayed reflex, 21 
Delayed reinforcement, 208 ff. 
Depressing effect on rate, see Anxiety, 
Cyclic deviations, Negative rein- 
forcement, Punishment 
Depth perception, 150 ff. 
Differentiation, response, (Ch. 6) 
186 ff. 

and discrimination, 192 ff. 

of force, 186 ff. 

of rate, 187 

of topography, 190 

of verbal behavior, 384, 385, 389 
Dilemma, 319 ff. 

Discrimination, stimulus (Ch. 5), 
ii6ff. 

abolishment of a, 122 ff. 

changing stimulus, 152 

in code learning, 149 

compound stimuli, 149 ff. 

concept formation, 155 ff. 

and differentiation, 192 ff. 

drive, effects of, 290 ff. 

extinction after, 123 



in humans, 121 

latency as a measure, 145 ff. 

multiple, 147 ff. 

operant, 117 

previous training, effect of, 120 ff. 

principle of, 117 

of rate, 97 ff. 

respondent, 117 

temporal, 21 ff., 86 
Discriminative capacity, 130 ff. 
Discriminative stimuli 

in avoidance, 311 ff. 

in bar-pressing chain, 198 ff. 

in chaining, 2ioff. 

in cooperation, 358 ff. 

in copulation, 302 

distinction from secondary rein- 
forcers, 238 ff. 

distortions of tact relation to, 391 ff. 

drives, accompanying, 276 ff., 287 ff. 

equality with secondary reinforcers, 
237 ff. 

and feelings, 256 

generality of, 242 ff. 

in imitation, 361, 363 

internal, 97 ff., 211 ff., 324, 349 ff., 
389 ff. 

proprioceptive, 97 ff., 21 iff., 324, 
349 ff., 389 ff. 

rate furnishing, 97 ff. 

response produced, 324 

as secondary reinforcers, 236 ff., 245 

self as, 369 ff. 

sex symbols, 303 

in submission, 356 

tact, 385 ff., 391 ff. 

transfer to new response, 242 ff. 

in verbal behavior, 385, 386 ff. 

in word association, 223 
Discussion method, 398 
Dominance, 356 ff. 
Double alternation, 203, 206 
Doubles, reinforcement of, 229 

See also, Units of response 
Drive (Ch. 9), 262 ff. 

activity drive, 278 ff. 

appetites as a class, 274 ff. 

aversions as a class, 274 ff/ 

during conditioning, ii2ff. 

and emotion, 329 ff. 

in extinction, 266 ff. 

hunger drive, 284 ff. 
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measurement of, 873 ff., 279 ff ., 
282 ff., 288 ff. 

misconceptions of, 276 ff. 

operations, 264 

in periodic reconditioning, 267 ff. 

physiological concept, not a, 277 ff. 

pleasure-directed, not, 278 

response, not a, 277 

in secondary reinforcement, 241 

sex drive, 294 ff. 

state, as a, 269 ff. 

stimuli accompanying, 276 ff., 286 ff. 

stimulus, not a, 276 ff. 

stimulus theory of, 286 ff. 

sub-hungers, 272 

and variability, 179 ff. 

in verbal behavior, 394 ff. 

See also Anxiety, Aversion, Avoid- 
ance, Social motives 
Dualism, mcntal-behavorial, 377, 



Eating behavior, 42 ff., 263 ff., 272 ff., 

291 ff. 

Echoic responses, 393 
Echolalia, 382 
Effect, law of, 39 

criticisms of, 41 

operant conditioning and, 51 ff. 
Effort, see Force 
Effort, law of least, 177 ff., 196 
Eliciting stimulus, 17, 19, 30, 47 ff., 

see Respondent conditioning 
Emotion (Ch. 10), in, 326 ff. 

acclimatization, 327 ff. 

anger, 342 ff. 

chain, breaking of, 342 ff. 

conditioning of, 346 ff. 

and drive, 329 ff. 

expression of, 335 ff., 344, 346 

facial expression in, 335 ff. 

fear, 340 ff. 

feelings, 350 

identification of, 333 ff., 337 ff. 

in infant, 335 ff., 338 

joy, 344 ff. 

negative reinforcement, 328 ff. 

observing our own, 348 ff. 

operations, 341, 342 ff., 345 

pleasantness, 350 



readings, 351 

signs of, in rat, 328 ff. 

sorrow, 344 ff. 

as a state, 331 ff. 

and stimulus intensity, 330 ff, 

theories of, 332 ff. 

verbal report of, 349 ff. 

See also Anxiety, Aversion 
Equilibrium, point of, 319, 320 
Errors, as a measure, 57 
Escape from reality, 319 
Escape training, 66, see also Aversion 
Exaggeration, 392 
Excitement, 346 
Exercise, law of, 40 
Expression of emotion, 335 ff., 337 ff. 
Extinction (Ch. 4), 67 ff. 

of avoidance responding, 314 

of a chain, 201 ff. 

curves, 71, 80, 90, 98, 107 

deviations in rate during, 71, 90 ff. 

in discrimination, 1 19 ff ., 123 

drive in, 266 ff. 

emotion in, 71, 328 

of emotion, 347 ff. 

envelope, 71 

after fixed-ratio reinforcement, 97 

and forgetting, 78 ff. 

of members of a chain, 201 ff. 

negative reinforcement and, 106 ff. 

one-trial, 82 ff. 

of operants, 70 ff. 

after periodic reconditioning, 89 

punishment and, 106 ff. 

respondent, 68 ff. 

in response differentiation, 186 

of secondary reinforcers, 234 

spontaneous recovery after, 70 

and variability, 181 ff. 

See also Resistance to extinction 
Extinction ratio, 88 

and fixed ratio, 91 ff. 

and resistance to extinction, 89 



Facial expression in emotion, 335 ff., 

337 . 

Family as social environment, 367 ff. 
Fatigue, reflex, 12 
Fear, 340 ff. 

effects, 341 

operations, 341 ff. 
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theories, 332 ff. 

See also Anxiety 
Fechner, 130 ff. 
Feelings, 254 ff., 350 
Fetishism, 305 
Fixation, see stereotypy 
Fixed interval, reinforcement at a, 

83 ff., 267 ff., 289 
Fixed ratio, reinforcement at a, 91 ff., 

97. l8 7 

Following a leader, 361 ff. 
Force of bar- pressing 

differentiation of, 186 ff. 

and resistance to extinction, 196 

variations in, 170 ff. 
Forgetting, 78 ff. 
Frequency as a measure, 50 ff. 
Freud, 104, 303, 325, 367, 394 
Frustration tolerance, 91 



Generalization, response, see Induc- 
tion 

Generalization, stimulus (Ch. 5), 
115 ff. 

in clinical treatment, 126 

in code learning, 148 

in concept formation, 155 ff. 

gradient of, 126 ff. 

and love, 376 

mediated, 160 ff. 

of a phobia, 125 

principle of, 116 

reaction time as measure, 143 
Gradient 

of approach, 317 ff. 

of avoidance, 317 ff. 

of generalization, i26ff. 

of reinforcement, 208 
Grief, 344 ff. 

Guilt-detection by word association, 
225 ff. 



Heterogeneous chains, 205 ff. 
History of reinforcement in individ- 
ual, 366 ff. 

Holding behavior, 304 ff., 313, 323 ff. 
Homogeneous chains, 205 ff. 
Homosexuality, 302 
Hunger drive, 284 ff. 
in humans, 289 ff. 



measurement of, 288 ff. 
stimulus theory of, 286 ff. 
and stomach contractions, 285 ff. 
in verbal behavior, 290 



Ideas, 212 ff., 377 ff. 
Imagination, 213 ff. 
Imitation, 193 ff., 361 ff. 
Incipient responses, 323 ff., 350 
Induction, response (Ch. 6), 168 ff. 

See also Variability 
Inefficient responses, maintenance of, 

178 ff. 

Infant, see Child 

Inferred variables, see Intervening 
responses, Propioceptive stimuli, 
State 

Inner speech, 2i3ff. 
Insects, social behavior in, 353 ff. 
Insight, 41, 59 ff. 

criticized, 65 ff. 

factors leading to, 60 
Integration of personality, 371 ff. 
Intensity of stimulus and function* 

330 ff. 
Interference 

in chaining, 240 

forgetting as, 79 ff. 

in word association, 225 
Interval, see Latency, Reaction time. 

Respondent conditioning 
Intervening responses 

in code learning, 217 ff. 

inner speech, 2i3ff. 

in reaction time, 218 ff. 
Introspection, 371, 379, 389 ff. 



James-Lange theory of emotional 

awareness, 333 ff. 
Joy, 256, 344 ff. 
Jumping apparatus, 137 ff. 



Killing of young, 300 
Kinesthetic stimuli, 211, 324 



Lag, visual, in light aversion, 209 

Lallation, 382 

Language, see Verbal behavior 
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Latency 

in code learning, 217 

cyclic effect in avoidance, 306, 314 

definition, 9 

in discrimination, 145 ff. 

as a measure, 18, 145 

in respondent conditioning, 18 

See also Reaction time 
Laws, see Effect, Least effort, Readi- 
ness 

Leadership, 361 ff. 
Least effort, law of, 177 ff., 196 
Lecture method in teaching, 398 
Lever-pressing apparatus, 43 ff. 
Light aversion, 304 ff. 

holding behavior in, 304 ff. 

See also Avoidance 
Limen, 8 ff., i3off. 

absolute, 8 ff., 130 ff., 133 ff. 

difference, 9, 130 ff., 133 ff. 

measurement of, 131 ff. 
Love, 375 ff. 
Lying, 392 



Magic mand, 385 
Mand, 384 ff. 
Masochism, 308 
Mastery motive, 375 
Maze learning 

analysis of, 228 

apparatus pictured, 57, 58, 206 

as chaining, 228 

T-maze, 58 

temporal maze, 206 

verbalization as aid, Go 
Meaning 

context theory of, 219 ff. 

in verbal behavior, 378, 385, 398 ff. 
Measures 

of acuity, 131 ff. 

of drive, 273 ff., 279 ff., 282 ff., 
288 ff. 

latency, 18, 145 

of maze learning, 57 

rate of pressing, 50 ff. 

reaction time, 143 

of reflex strength, 8 ff. 

time, 37 ff., 54 if., 57 

See also Resistance to extinction 
Mediated generalization, 160 ff. 
Memory, 78 ff. 



Mental activity 

introspection of, 371, 379, 389 ff. 

response interpretation of, 2isff. 
Mental-behavioral dualism, 377, 390 ff. 
Metaphors, 391 

Morse Code, see Code learning 
Motivation (Ch. 9), 262 ff. 

See also Drive 
Motives, social, 373 ff. 

approval, 374 ff. 

identifying by goal, 374 ff. 

love, 375 ff. 

mastery, 375 

nominal nature, 374 ff. 

prestige, 374 ff. 
Muscle sense, 211 

see also Proprioceptive stimuli 



Negative reinforcement, 61 ff., 104 ff., 

113, 303 ff. 
defined, 61 

depressing effects on rate, 328 ff. 
and dominance, 357 
and emotional expression in child, 

336 

as emotional stimulus, 328 ff. 

and fear, 340 ff. 

functions of, 61 ff. 

and law of least effort, 177 ff. 

punishment, 104 ff., 113 

and submission, 357 

and variability, 174 ff. 

See also Anxiety, Aversion, Avoid- 
ance, Conflict, Cooperation 
Nervous system 

autonomic, 5, 27 ff. 

somatic, 5 
Neurosis, experimental, 28 ff., 138 ff. 

audiogenic seizures, 305 
Nonsense syllables, 78 ff. 
Novel responses, see Response differ- 
entiation 

Noxious stimuli, see Negative rein- 
forcement 



Obstruction method, 288 ff. 
apparatus, 288 ff. 
hunger drive measured, 288 ff. 
maternal behavior measured, 299 ff. 
sex drive measured, 297 ff. 
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Oedipus complex, 367 
Oestrus cycle, 295 ff. 
Operant behavior 

defined, 49 ff. 

strength, measure of, 50 ff . 
Operant conditioning (Ch. 3), 36 ff. 

apparatus, 44 ff. 

description, 48 ff. 

and effect, law of, 51 ff. 

of emotional expression, 336 

importance of, 64 ff. 

as measure of discriminative capac- 
ity, 135 

measures of, 50 ff., 72 ff. 

paradigm, 48 

procedure, 45 ff. 

rate as a measure, 50 ff. 

resistance to extinction, 72 ff. 

respondent, compared with, 48 ff., 

5i 

respondent overlap, 62 ff . 
schema, 48 
as Type R, 47 ff. 

Type S, compared with, 48 ff., 51 
of verbal behavior, 384 
Operant level, 76 
and activity drive, 284 
defined, 76 
factors affecting, 284 



Pairs, reinforcement of, 229 

See also Units of response 
Pavlov, 15 ff. 

historical importance of, 33 ff. 
Perception, 150 ff. 

constancies in, 151 ff. 

of depth, 150 ff. 

higher units in, 153 ff. 
Periodic reconditioning, see Periodic 

reinforcement 
Periodic reinforcement (Ch. 4), 83 ff. 

aperiodic reinforcement, 98 ff. 

drive in, 267 ff., 289 

extinction following, 89 ff., 97 ff. 

extinction ratio, 88 

at a fixed interval, 83 ff. 

at a fixed ratio, 91 ff. 

length of interval, 88 

in measuring drive, 289 

and resistance to extinction, 89 ff. 

and resistance to shift, 189 



and response variation, 189 

temporal discrimination in, 86 

units, formation of new, 207 ff. 
Personality, 366 ff. 

consciousness, 370 ff. 

consistency of, 371 ff. 

integration of, 371 ff. 

self, 369 ff. 
Philosophy, impact of behavioral 

analysis on, 399 
Phobia, origin of a, 125 
Phonemes, 380 ff. 
Pleasantness, 350 

Positive reinforcement, 61, see Rein- 
forcement 

Prestige as a motive, 374 ff. 
Private stimuli, see Introspection 
Problem box, 36 ff., 165 ff. 
Projective techniques, see Verbal sum- 
ma tor 
Proprioceptive stimuli, 211 ff., 313, 

324, 349 ff., 389 ff. 
Pseudo-conditioning, 29 
Psychophysics, 9, 131 ff. 
Punishment, 104 ff., 113 

and dominance, 357 

and extinction, 104 ff. 

and submission, 357 

See also Negative reinforcement 
Pursuit-meter, 153 



Rage, 342 

and sorrow, 345 
Rate as a measure, 50 ff. 
Ratio, extinction, 88 
Ratio, fixed, 91 ff. 
Rationalization, 372 
Reaction time, i4off. 

in association, 225 

covert response in, 218 ff. 

intervening response in, 218 ff. 

as a measure, 143 

in personnel selection, 146 ff. 

in word association, 225 

See also Latency 
Readiness, law of, 40 
Receptors, 4 
Reconditioning 

in a chain, 201 ff. 

and variability, 181 ff. 

See also Periodic reinforcement 
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Recovery, spontaneous, 70, 76 ff. 
Redintegration, 152, 391 
Reflex (Ch. i), 4 ff. 

conditioned, 15 ff. 

definition, 4 

fatigue, 12 

history of concept, 6 ff. 

intensity of stimulus as factor, 10 

latency, 9, 18 

limen, 8 ff. 

magnitude of response, 10, 17 ff. 

measures of strength, 8 ff. 

properties, 8 ff. 

salivary, 16 ff. 

threshold, 8 ff. 

time, 22 ff. 

unconditioned, 19, 25 
Reflex arc, 5 ff. 
Reflex strength, 10 ff. 
Regression, 81 ff., 204 
Reinforcement (positive) 

accidental, 103 ff. 

aperiodic, 98 ff. 

in cooperation, 358 ff. 

denned, 61 

delayed, 208 ff. 

and dominance, 357 

of emotional expression in child- 
hood, 336 

at a fixed interval, 83 ff. 

at a fixed ratio, 91 ff., 97, 187 

generalized, 386 

of hearer, 388 

independent of response, 103 ff. 

regular, 46 ff. 

removal of stimuli as, 61 

of respondents, 20 

of self -observation, 349 

of speaker, 387 ff. 

of verbal behavior, 384 

withdrawal of, 342 ff. 

See also Conflict, Negative rein- 
forcement, Periodic reinforce- 
ment, Secondary reinforcement 
Reinforcement, selective, see Differ- 
entiation, Discrimination 
Reinforcement gradient, 208 
Reinforcement history, 366 ff. 
Reinforcing stimulus, 20, see Rein- 
forcement, Negative reinforce- 
ment, Secondary reinforcement 
Releasing mechanisms, 302 



Removal of stimuli as reinforcement, 
see Aversion, Avoidance, Nega- 
tive reinforcement 
Repetition of stimulus word, 393 
Reports, verbal, see Introspection 
Repression, 104 
Resistance to extinction 

amount of reinforcement and, 73 ff. 
following aperiodic reinforcement, 

100 ff. 

criterion of, 72, 76 
following discrimination training, 

i3 

drive and, 74 ff. 
and extinction ratio, 89 
factors affecting, 72 ff. 
force, function of, 196 
as measure of strength, 72 
motivation and, 74 ff. 
number of reinforcements and, 

72 ff. 
following periodic reinforcement, 

8 9 

punishment, effects of, 106 ff. 

strength, measure of, 72 
Respondent behavior (Chs. i, 2), 4 ff. 

defined, 49 

strength, measures of, 8 ff. 

See also Reflex 
Respondent conditioning (Ch. 2), 

15 f- 

apparatus, 16 ff. 

autonomic nervous system, 27 ff. 

defined, 30 

delayed, 21 

discrimination in, iiyff. 

discriminative capacity, measure of, 

134 
of emotion, 346 ff. 

higher-order, 30 ff., 233 

historical importance, 32 ff. 

neural basis, 27 

paradigm, 19, 48 

procedure, 16 ff. 

pseudo-, 29 

punishment, 109 ff. 

of salivary reflex, 16 ff. 

schema, 19, 48 

secondary, 30 ff., 233 

temporal factors, 20 ff. 

trace, 21 

Type R, compared with. 48 ff., 51 
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as Type S, 47 ff. 

to verbal stimuli, 25 ff. 
Response 

definition, 3 

differentiation, 186 ff., tec Differ- 
entiation 

induction, 143 ff., 168 ff. 

magnitude, 10, 17 ff. 

rate, 50 ff. 

Response by analogy, 40 
Response systems in personality, 

372 ff. 

Restraint of response, effects of, 342 ff. 
Retention 

of bar-pressing, 79 ff. 

of nonsense syllables, 78 ff. 
Reward training, 66, see Operant 

conditioning 

Rote learning as chaining, 229 
Running response, see Activity wheel, 

Runway 
Runway, 53 ff. 



Salivary reflex, 16 ff. 
Savings method in memory, 78 
Secondary language, 397 
Secondary reinforcement (Ch. 8), 
231 ff. 

in chaining, 239 ff . 

defined, 232 ff. 

dependence on discrimination 
training, 236 ff., 245 

distinction from discriminative 
stimuli, 238 ff. 

equality with discriminative stimu- 
lus, 237 ff. 

experimental examples, 234 ff. 

factors affecting, 243 ff. 

and feelings, 254 ff. 

generality of, 240 ff., 386 ff. 

masochism, 308 

and motives, 375 

negative, 249 ff., see Anxiety, Avoid- 
ance 

number of pairings as factor, 243 

parameters, 243 ff. 

in personal adjustment, 259 

presentation of, 345 ff. 

removal of, 345 

of respondents, 233 

in social behavior. 257 ff. 



and social motives, 375 

and sorrow, 345 

sub-goals, 247 ff. 

temporal relation to primary, 243 ff. 

token rewards, 247 ff. 

of Type S reactions, 233 

See also Chaining, Discriminative 

stimuli, Anxiety, Avoidance 
Secondary reward training, 66, see 

Secondary reinforcement 
Selective reinforcement, see Differ- 
entiation, Discrimination 
Self 

discrimination of, 369 ff. 

observation of, 348 ff. 

report of, 371 ff. 

See also Introspection 
Semantics, see Meaning 
Sense organs, 4 
Set, 239 ff. 
Sex drive, 294 ff. 

abnormal objects of, 302 

activity wheel studies, 295 ff. 

effects of castration, 301 

homosexuality, 302 

in human, 300 ff. 

and love, 375 

obstruction method studies, 297 ff. 

oestrus cycle, 295 ff. 

slips of tongue, 303 

symbolism, 303 
Shock aversion, 306 ff. 

See also Avoidance 
Similarity, meaning of, 123 ff. 
Slap, as punishment, 106 ff. 
Slips of tongue, 303 
Slot machine, 188 ff., 247 ff. 
Social behavior (Ch. 11), 352 ff. 

in animals, 353 ff. 

consciousness, 370 

cooperation, 357 ff., 399 

cultural environment, 363 ff., 368 

dominance, 356 ff. 

emotional expression, 335 ff. 

history of reinforcement, 366 ff. 

imitation, 361 ff. 

leadership, 361 ff. 

motives, 373 ff. 

personality, 366 ff., 371 ff. 

readings, 399 

reinforcement-history, 366 ff. 

self, 369 ff. 
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stimuli, 352 ff. 

submission, 356 ff. 

See also Verbal behavior 
Social motives, 373 ff. 

approval, 374 ff. 

identification with goal, 374 ff. 

love, 375 ff. 

mastery, 375 

prestige, 374 ff. 
Sorrow, 256, 344 ff. 
Sound aversion, 305 ff. 

Audiogenic seizures, 305 

Cyclic latencies in, 306 
Span of apprehension, 154 
Speech, see Verbal behavior 
Spontaneous activity, see Activity 

drive, Operant level 
Spontaneous recovery 

operant, 76 ff. 

respondent, 70 

time, as function of, 77 ff. 
State, 

drive as a, 269 ff. 

emotion as a, 331 ff. 
Stereotypy, 165 ff. 

in shock aversion, 307 
Stimuli 

compound, 23 ff. 

conditioned, 19, 25 

definition, 3 

discriminative, 118 

accompanying drive, 276 ff., 286 ff. 

eliciting, 19, 25 

intensity, 10 

latency, 9 

reinforcing, 20 

social, 352 ff. 

threshold, 8 ff., 130, 133 

unconditioned, 19 

See also Anxiety, Aversion, Avoid- 
ance, Discrimination, Generaliza- 
tion, Negative reinforcement, 
Proprioceptive stimuli, Reinforce- 
ment, Secondary reinforcement 
Stomach contractions, 285 ff. 
Strength of response, see Force, Re- 
flex, Resistance to extinction 
Sub-goals, 247 ff. 
Sub-hungers, 272, 291 ff. 

in cats, 291 ff. 

in infants, 292 ff. 



Subjective data, see Introspection 

Submission, 356 ff. 

Successive approximation, 190, see 
Differentiation 

Summator, verbal, 394 ff. 

Superficial responses in word associa- 
tion, 224 

Superstition, 102 ff., 209 ff. 



T-box, 138 
T-maze, 58 
Tact, 385 ff. 

distortion of, 391 ff. 
Teaching 

discussion method, 398 

lecture method, 398 

as response differentiation, 190 ff. 
Temporal discrimination, 86 
Temporal maze, 206 
Terror, see Fear 
Thinking, 213 ff. 
Thorndike, 36 ff., 104 ff. 

criticisms of, 41 
Threshold, 

absolute, 8 ff., 130, 133 

difference, 8 ff., 130 

measurement of, 131 ff. 
Time as a measure, 37 ff., 54 ff., 57 
Time discrimination, 21 ff., 86 
Time reflex, 22 ff. 
Token rewards, 247 ff., see Secondary 

reinforcement 
Topography 

variations in, 169 ff., 195 ff. 

differentiation of, 190 
Trace reflex, 21 
Transfer of training, 169 ff. 
Transposition, 158 ff. 
Trial-and-error learning, 36 ff., 66, 

see also Operant conditioning 
Triple-pressings, reinforcement of, 
203 ff., see also Units of response 
Type R conditioning, 47 ff., see also 

Operant conditioning 
Type S conditioning, 47 ff., see also 
Respondent conditioning 



Unconditioned stimulus, 19, see also 

Respondent conditioning 
Understanding, 397 ff. 
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Units of response, 200 ff., 203 ff., 
207 ff., 229 



Vacillation, 320, 322, 323 ff. 
Variability of response (Ch. 6), 
165 ff., 195 ff. 

drive and, 179 ff. 

in extinction, 181 ff. 

least effort, law of, 177 ff. 

following reconditioning, 181 ff. 
Verbal behavior (Ch. 11), 376 ff. 

association experiment, 221 ff. 

audible, 396 

audience, 394 

babbling, 382 

chaining in, 214 ff. 

in children, 193 ff., 380 ff., 384, 396 

community, verbal, 383 ff. 

consciousness, 370 ff ., 397 

consistency with other behavior, 
371 ff. 

in deaf children, 383 

development of, 380 ff. 

distortions of tacts, 391 ff. 

drive and, 290, 394 ff. 

dualism, mental-behavioral, 377 

echoic responses, 393 

echolalia, 382 

exaggeration, 392 

explanations, circular, 377 

feelings, 350 

hearer, 387 ff. 

introspection, 379 

lallation, 382 



listener, 387 ff. 

lying, 392 

mand, 384 ff. 

meaning, 378, 398 ff. 

mental events, 379 

metaphors, 391 

and philosophy, 399 

phonemes, 380 ff. 

private stimuli in, 349 ff. 

report of emotion, 349 ff. 

secondary language, 397 

self, discrimination of, 369 ff. 

slips of tongue, 303 

stages in, 382 

summator, verbal, 394 ff. 

tact, 385 ff., 391 ff. 

textual factors in, 393 

theory, general, 378 ff. 

understanding, 397 ff. 

vocal, reasons why, 395 ff. 

word association, 221 ff. 
Verbal reports 

of emotion, 333, 349 ff. 

See also Introspection 
Verbal summator, 394 ff. 
Visual lag in light aversion, 209 
Vocal, reasons verbal behavior is, 

395 ft- 

Voluntary control in respondent con- 
ditioning, 25 ff. 



Watson, 338, 347, 379 
Word association, 221 ff., sec Associa- 
tion 



